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Foreword to Earlier Series Editions 


More than a generation of German-speaking students around the world have worked 
their way to an understanding and appreciation of the power and beauty of modem 
theoretical physics - with mathematics, the most fundamental of sciences - using 
Walter Greiner’s textbooks as their guide. 

The idea of developing a coherent, complete presentation of an entire field 
of science in a series of closely related textbooks is not a new one. Many older 
physicists remember with real pleasure their sense of adventure and discovery 
as they worked their ways through the classic series by Sommerfeld, by Planck 
and by Landau and Lifshitz. From the students’ viewpoint, there are a great many 
obvious advantages to be gained through use of consistent notation, logical ordering 
of topics and coherence of presentation; beyond this, the complete coverage of 
the science provides a unique opportunity for the author to convey his personal 
enthusiasm and love for his subject. 

The present five-volume set, Theoretical Physics, is in fact only that part of 
the complete set of textbooks developed by Greiner and his students that presents 
the quantum theory. I have long urged him to make the remaining volumes on 
classical mechanics and dynamics, on electromagnetism, on nuclear and particle 
physics, and on special topics available to an English-speaking audience as well, 
and we can hope for these companion volumes covering all of theoretical physics 
some time in the future. 

What makes Greiner’s volumes of particular value to the student and professor 
alike is their completeness. Greiner avoids the all too common “it follows that ...” 
which conceals several pages of mathematical manipulation and confounds the 
student. He does not hesitate to include experimental data to illuminate or illustrate 
a theoretical point and these data, like the theoretical content, have been kept up to 
date and topical through frequent revision and expansion of the lecture notes upon 
which these volumes are based. 

Moreover, Greiner greatly increases the value of his presentation by including 
something like one hundred completely worked examples in each volume. Nothing 
is of greater importance to the student than seeing, in detail, how the theoretical 
concepts and tools under study are applied to actual problems of interest to a 
working physicist. And, finally, Greiner adds brief biographical sketches to each 
chapter covering the people responsible for the development of the theoretical ideas 
and/or the experimental data presented. It was Auguste Comte (1798-1857) in his 
Positive Philosophy who noted, “To understand a science it is necessary to know 
its history”. This is all too often forgotten in modem physics teaching and the 
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Foreword to Earlier Series Editions 


bridges that Greiner builds to the pioneering figures of our science upon whose 
work we build are welcome ones. 

Greiner’s lectures, which underlie these volumes, are internationally noted for 
their clarity, their completeness and for the effort that he has devoted to making 
physics an integral whole; his enthusiasm for his science is contagious and shines 
through almost every page. 

These volumes represent only a part of a unique and Herculean effort to make 
all of theoretical physics accessible to the interested student. Beyond that, they 
are of enormous value to the professional physicist and to all others working with 
quantum phenomena. Again and again the reader will find that, after dipping into a 
particular volume to review a specific topic, he will end up browsing, caught up by 
often fascinating new insights and developments with which he had not previously 
been familiar. 

Having used a number of Greiner’s volumes in their original German in my 
teaching and research at Yale, I welcome these new and revised English translations 
and would recommend them enthusiastically to anyone searching for a coherent 
overview of physics. 

Yale University D. Allan Bromley 

New Haven, CT, USA Henry Ford II Professor of Physics 

1989 



Preface to the Third Edition 


We are pleased to note that our text Relativistic Quantum Mechanics - Wave Equa- 
tions has found many friends among physics students and researchers so that the 
need for a third edition has arisen. We have taken this opportunity to make several 
amendments and improvements to the text. A number of misprints and minor errors 
have been corrected and explanatory remarks have been given at various places. 

We thank several colleagues and students for helpful comments. We also thank 
Dr. Stefan Hofmann who has supervised the preparation of the third edition of the 
book. Finally we acknowledge the agreeable collaboration with Dr. H. J. Kolsch 
and his team at Springer- Verlag, Heidelberg. 

Frankfurt am Main, Walter Greiner 

March 2000 



Preface to the Second Edition 


For its second edition the book Relativistic Quantum Mechanics - Wave Equations 
has undergone only minor revisions. A number of misprints and errors in a few 
equations have been corrected. Also the typographical appearance and layout of 
the book has been improved. I hope that the book will continue to be useful to 
students and teachers alike. 

I thank Markus Bleicher for his help and acknowledge the agreeable collabo- 
ration with Dr. H. J. Kolsch and his team at Springer- Verlag, Heidelberg. 

Frankfurt am Main, Walter Greiner 

March 1997 



Freface to the First Edition 


Theoretical physics has become a many-faceted science. For the young student it is 
difficult enough to cope with the overwhelming amount of new scientific material 
that has to be learned, let alone obtain an overview of the entire field, which 
ranges from mechanics through electrodynamics, quantum mechanics, field theory, 
nuclear and heavy-ion science, statistical mechanics, thermodynamics, and solid- 
state theory to elementary-particle physics. And this knowledge should be acquired 
in just 8-10 semesters, during which, in addition, a Diploma or Master’s thesis has 
to be worked on or examinations prepared for. All this can be achieved only if the 
university teachers help to introduce the student to the new disciplines as early on 
as possible, in order to create interest and excitement that in turn set free essential 
new energy. Naturally, all inessential material must simply be eliminated. 

At the Johann Wolfgang Goethe University in Frankfurt we therefore confront 
the student with theoretical physics immediately, in the first semester. Theoretical 
Mechanics I and II, Electrodynamics, and Quantum Mechanics I - An Introduction 
are the basic courses during the first two years. These lectures are supplemented 
with many mathematical explanations and much support material. After the fourth 
semester of studies, graduate work begins, and Quantum Mechanics II - Symme- 
tries, Statistical Mechanics and Thermodynamics, Relativistic Quantum Mechanics, 
Quantum Electrodynamics, the Gauge Theory of Weak Interactions, and Quantum 
Chromodynamics are obligatory. Apart from these a number of supplementary 
courses on special topics are offered, such as Hydrodynamics, Classical Field The- 
ory, Special and General Relativity, Many-Body Theories, Nuclear Models, Mod- 
els of Elementary Particles, and Solid-State Theory. Some of them, for example 
the two-semester courses Theoretical Nuclear Physics and Theoretical Solid-State 
Physics, are also obligatory. 

The form of the lectures that comprise Relativistic Quantum Mechanics - Wave 
Equations follows that of all the others: together with a broad presentation of the 
necessary mathematical tools, many examples and exercises are worked through. 
We try to offer science in as interesting a way as possible. With relativistic quantum 
mechanics we are dealing with a broad, yet beautiful, theme. Therefore we have 
had to restrict ourselves to relativistic wave equations. The selected material is 
perhaps unconventional, but corresponds, in our opinion, to the importance of this 
field in modem physics: 

The Klein-Gordon equation (for spin-0 particles) and the Dirac equation (for 
spin- 1 particles) and their applications constitute the backbone of these lectures. 
Wave equations for particles with higher spin (the Rarita-Schwinger, spin-|, Kem- 
mer and Proca, spin-1, and general Bargmann-Wigner equations) are confined to 
the last chapters. 
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After introducing the Klein-Gordon equation we discuss its properties and diffi- 
culties (especially with respect to the single-particle interpretation); the Feshbach- 
Villars representation is given. In many worked-out exercises and examples its 
practical applications can be found: pionic atoms as a modem field of research 
and the particularly challenging examples on the effective pion-nucleus potential 
(the Kisslinger potential) and its improvement by Ericson and Ericson stand in the 
foreground. 

Most of these lectures deal with Dirac’s theory. The covariance properties 
of the Dirac equation are discussed in detail. So, for example, its free solutions 
are on the one hand determined directly and on the other hand through Lorentz 
transformations from the simple solutions in the rest frame. Here the methodical 
issue is emphasized: the same physical phenomenon is illuminated from different 
angles. We proceed in a similar manner in the discussion of single-particle operators 
(the odd and even parts of an operator) and the so-called Zitterbewegung, which is 
also derived from the consideration of wave packets of plane Dirac waves. In many 
worked-out problems and examples the new tools are exercised. Thus the whole 
of Chap. 9 is dedicated to the motion of Dirac particles in external potentials. 
It contains simple potential problems, extensively the case of the electron in a 
Coulomb potential (the fine-structure formula), and muonic atoms. In Chap. 10 we 
present the two-centre Dirac equation, which is of importance in the modem field 
of heavy-ion atomic physics. The fundamental problem of overcritical fields and 
the decay of the electron-positron vacuum is only touched upon. A full treatment 
is reserved for Quantum Electrodynamics (Vol. 4 of this series). However, we give 
an extended discussion of hole theory and also of Klein’s paradox. The Weyl 
equation for the neutrino (Chap. 14) and relativistic wave equations for particles 
with arbitrary spin (Chap. 15) follow. Starting with the Bargmann-Wigner equations 
the general frame for these equations is set, and in numerous worked-out examples 
and exercises special cases (spin-1 particles with and without mass, and spin-| 
particles according to Rarita and Schwinger) are considered in greater detail. In 
the last chapter we give an overview of relativistic symmetry principles, which 
we enjoy from a superior point of view, since by now we have studied Quantum 
Mechanics - Symmetries (Vol. 2 of this series). 

We hope that in this way the lectures will become ever more complete and may 
lead to new insights. 

Biographical notes help to obtain an impression, however short, of the life 
and work of outstanding physicists and mathematicians. We gratefully acknowl- 
edge the publishers Harri Deutsch and F.A. Brockhaus ( Brockhaus Enzyklopadie, 
F.A. Brockhaus - Wiesbaden indicated by BR) for giving permission to use relevant 
information from their publications. 

Special thanks go to Prof. Dr. Gerhard Soff, Dr. Joachim Reinhardt, and Dr. 
David Vasak for their critical reading of the original draft of these lectures. Many 
students and collaborators have helped during the years to work out examples and 
exercises. For this first English edition we enjoyed the help of Maria Berenguer, 
Christian Borchert, Snjezana Butorac, Christian Derreth, Carsten Greiner, Kordt 
Griepenkerl, Christian Hofmann, Raffele Mattiello, Dieter Neubauer, Jochen Rau, 
Wolfgang Renner, Dirk Rischke, Alexander Scherdin, Thomas Schonfeld, and 
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XI 


Dr. Stefan Schramm. Miss Astrid Steidl drew the graphs and prepared the fig- 
ures. To all of them we express our sincere thanks. 

We would especially like to thank Mr. Bela Waldhauser, Dipl.-Phys., for his 
overall assistance. His organizational talent and his advice in technical matters are 
very much appreciated. 

Finally, we wish to thank Springer- Verlag; in particular, Dr. H.-U. Daniel, for 
his encouragement and patience, Mr. Michael Edmeades for expertly copy-editing 
the English edition, and Mr. R. Michels and his team for the excellent layout. 

Frankfurt am Main, Walter Greiner 

May 1990 
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1. Relativistic Wave Equation for Spin-0 Particles: 
The Klein-Gordon Equation and Its Applications 


The description of phenomena at high energies requires the investigation of rela- 
tivistic wave equations. This means equations which are invariant under Lorentz 
transformations. The transition from a nonrelativistic to a relativistic description 
implies that several concepts of the nonrelativistic theory have to be reinvestigated, 
in particular: 


(1) Spatial and temporal coordinates have to be treated equally within the theory. 

(2) Since 


Ap nioc 


a relativistic particle cannot be localized more accurately than « hjm§c\ oth- 
erwise pair creation occurs for E > Im^c 2 . Thus, the idea of a free parti- 
cle only makes sense if the particle is not confined by external constraints 
to a volume which is smaller than approximately the Compton wavelength 
A c = h/rriQC , Otherwise the particle automatically has companions due to par- 
ticle-antiparticle creation. 

(3) If the position of the particle is uncertain, i.e. if 

A h 
Ax > . 

niQC 


then the time is also uncertain, because 


At ~ 


Ax 

— > 
c 


h 

moc 2 


In a nonrelativistic theory At can become arbitrarily small, because c — * oo. 
Thereby, we recognize the necessity to reconsider the concept of probability 
density 


6(*,y,z,t) , 

which describes the probability of finding a particle at a definite place r at 
fixed time t. 

(4) At high (relativistic) energies pair creation and annihilation processes occur, 
usually in the form of creating particle-antiparticle pairs. Thus, at relativistic 
energies particle conservation is no longer a valid assumption. A relativistic 
theory must be able to describe pair creation, vacuum polarization, particle 
conversion, etc. 
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1. Relativistic Wave Equation for Spin-0 Particles 


i.i The Notation 


First we shall remark on the notation used. Until now we have expressed four- 
vectors by Minkowski’s notation, with an imaginary fourth component, as for 
example 


x = {x,y,z,ict} 

p = {pxiPyiPzAE/c} 

A = { A x , Ay , A z , iAo } 

^ _ f d a d d \ 

V ” 1 dx ’ dy ’ dz ’ i d(ct) J 


(world vector) , 
(four-momentum) , 
(four-potential) , 

(four-gradient), etc. 


( 1 . 1 ) 


The letters x , /?, A, V abbreviate the full four- vector. Sometimes we shall also 

denote them by x , P , A, V, etc., i.e. with a double arrow. As long as there is no 
confusion arising, we prefer the former notation. For the following it is useful to 
introduce the metric tensor ( covariant compontents) 


9oo 

9o\ 

502 

503 \ 


z 1 

0 

0 

°\ 

9io 

9u 

5l2 

513 


0 

-l 

0 

0 

920 

9ii 

522 

523 


1° 

0 

-l 

0 

V(?30 

531 

532 

933 / 

\o 

0 

0 

-1/ 


Thereby, one can denote the length of the vector dx = {dx^} as ds z = dx dx = 
dr^dr^. This relation is often taken as the defining relation of the metric 
tensor. 1 The contravariant form g ^ of the metric tensor follows from the condition 


def 


9^ 9a, = K = 


/I 0 0 0\ 
0 10 0 
0 0 10 
Vo o o 1/ 


<r = Or V = 


■‘/ACT 


/I 

0 

0 

Vo 


0 

-1 

0 

0 


0 

0 

-1 

0 


0 

-1 / 


(1.3) 


(1.4) 


Here zA /lcT is the cofactor of g )la [i.e. the subdeterminant, obtained by crossing out 
the /xth row and the <rth column and multiplying it with the phase (— 1) M+<T ] and 
g is given by g — dei(g in/ ) = -1. For the special Lorentz metric the contravariant 
and co variant metric tensor are identical: 


[for Lorentz metric!] . 

From now on we will use the contravariant four-vector 

* M = lx 0 ,* 1 ,* 2 ,* 3 } = {ct,x,y,z} (1.5) 

for the description of the space-time coordinates, where the time-like component 
is denoted as zero component. We get the covariant form of the four-vector by 
“lowering” the index p with the help of the metric tensor, i.e. 


1 We adopt the same notation as J.D. Bjorken, S D Drell: Relativistic Quantum Mechanics 

(McGraw Hill, New York 1964). 
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= {ct,-x,-y,-z} = {x o,xi,x 2 ,Xi} . (1.6) 

Similarly the indices can be “raised” to give 
X U _ g ^ x u = |a°, jc 1 , jc 2 , jc 3 } 

This means that one can easily transform the covariant into the contravariant form 
of a vector (respectively of a tensor) and vice versa. Except in special cases, where 
we denote it explicitly, we use Einstein ’s summation convention : We automatically 
add from 0 to 3 over indices occuring doubly (one upper and one lower index). So 
we have, for example, 

3 

x • X = X^Xp = = X°Xq +X l X\ +x 2 x 2 + jc 3 jc 3 


/i=0 


c 2 f 2 


2 2 2 
■x -y —z 


— c 2 t 2 — x 2 


The definition of the four-momentum vector is analogous, 

= { E /c,Px,Py,Pz} , 

and we write the scalar product in four dimensions (space-time) as 


n E\ E2 
PxP2n = — — - Pi-Pi 


(1.7) 

( 1 . 8 ) 

(1.9) 

( 1 . 10 ) 


Pi Pl - 
ot equally 

x p = x^p^ = x ^ = Et -x-p 
We identify the four-vectors by a common letter. Thus, for instance, 

0 = {00, 0i j 02i 03} 

In contrast to this we denote three-vectors by bold type as in 
a ={ 01 , 02 , 03 } . 

Often we write only the components. Hence, 
a M = {a 0 , a 1 , a 2 , a 3 } 

means a four-vector with contravariant components. Greek indices, such as p, 
always run from 0 to 3. Latin indices, as for example i, imply values from 1 to 3. 
A three-vector can thus also be written in contravariant form as 

a' = {a 1 , a 2 , a 3 } or as a i = {a u a 2 ,a i } 

in covariant form. So the four-momentum operator is therefore denoted by 




“sH 


, .. d d d d ) 

_ i ... d d _ d ., <9 


= i h 


d 


d(ct) 


-V 


( 1 . 11 ) 
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It transforms as a contravariant four- vector, so that 


P^Pn = 


Ox,, dx“ V c 1 dt 2 




d 2 ^ d 2 d 2 
~dx 2 + df- + dz 2 


( 1 . 12 ) 


This equation defines both the three-dimensional delta operator (A — V 2 ) and 
the four-dimensional d’Alembertian (□ = d 2 f(c 2 dt 2 ) — A). Finally we check the 
commutation relations of momentum and position by means of (1.11 and 1.5), 
obtaining 


[p M , x u ] = ih 


d 

3V 




dX a 

dx,j. 


= = W* = ifoT 


(1.13) 


On the right-hand side (rhs), the metric tensor appears expressing the covariant 
form of the commutation relation. 

The four-potential of the electromagnetic field is given by 


^ = {A 0 ,A} = {A 0 ,A x ,A y ,A l } = g lu 'A v 


(1.14) 


Here A 1 ' are the contravariant, and A^ — {Ao, — A x , — A y , —A z } the covariant com- 
ponents. From A M the electromagnetic field tensor follows in the well-known way: 


p n* — 


dA * 
dx v 


dA u 

dx^ 


0 E x E y E z 

-E x 0 B z -B y 

-Ey -B z 0 B x 

^z c v ~ D X u 


(1.15) 


1.2 The Klein-Gordon Equation 


From elementary quantum mechanics 2 we know the Schrodinger equation 



h 2 

2m<> 


V 2 + V(x) 


ip(x,t) 


corresponds to the nonrelativistic energy relation in operator form, 


(1.16) 


E = — + V(x) , where (1.17) 

2l7lQ 

E = ih%- , p = -ihV (1.18) 

dt 

are the operators of energy and momentum, respectively. In order to obtain a 
relativistic wave equation we start by considering free particles with the relativistic 
relation 


2 See W. Greiner: Quantum Mechanics - An Introduction , 3rd ed. (Springer, Berlin, Hei- 
delberg 1994). 
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E 2 

P^Pn = ~2 -P P = m%c 2 . (1.19) 

We now replace the four-momentum p M by the four-momentum operator 

r = uJ—nI°±,-±,-±\ 

dXfi \ d{ct) dx dy dz j 

= m {m’- v } = {p °-n - <>- 20 > 

Following (1.6) and (1.11), the result is in accordance with (1.18). Thus, we obtain 
the Klein-Gordon equation for free particles, 

= mlc 1 ^ . ( 1 . 21 ) 


Here mo is the rest mass of the particle and c the velocity of light in vacuum. With 
the help of (1.12) we can write (1.21) in the form 



mfic 2 

h 2 


ip = 


d 2 a 1 d 2 d 2 mkc 2 \ 

c 2 dt 2 dx 2 dy 2 dz 2 h 2 ) ^ ^ 


( 1 . 22 ) 


We can immediately verify the Lorentz covariance of the Klein— Gordon equation, as 
p^pn, is Lorentz invariant. We also recognize (1.22) as the classical wave equation 
including the mass term mkc 2 /h 2 . Free solutions are of the form 


'ip = exp ( --p^x* ) = exp 


■T (P0X° - P • X) 


= exp 


+ ^r (p-X-Et) 


(1.23) 


Indeed, insertion of (1.23) into (1.21) leads to the condition 


= m o P^Pi* exp ( -pPn** ) = mlc 2 exp ( 

E 2 

^P tl Pn = m£c 2 or — j- -p-p = mfic 2 , 
c z 

which results in 

E — ±\/m$c 2 + v 2 . 


(1.24) 


Thus, there exist solutions both for positive E — +c(mkc 2 4- p 2 ) 1 ' 2 as well as 
for negative E = — c{mlc 2 + p 2 ) 1//2 energies respectively (see Fig. 1.1). We shall 
see later that the solutions yielding negative energy are physically connected with 
antiparticles. Since antiparticles can indeed be observed in nature, we have already 
obtained an indication of the value of extending the nonrelativistic theory. 

Next we construct the four-current connected with (1.21). In analogy to our 
considerations concerning the Schrodinger equation, we expect a conservation law 
for the jp. We start from (1.22), in the form 

(PuP* - mlc 2 ) xp = 0 , 


E 

+mQ(? 

-rape 2 


Fig. i.i. Energy spectrum of 
the free Klein-Gordon equa- 
tion 
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and take the complex conjugate of this equation, i.e. 

{pup 1 * - m oC 2 ) = 0 . 

Multiplying both equations from the left, the first by ib* and the second bv ip, and 
calculating the difference of the resulting two equations yields 

V - mlc 2 ) ip - {P^ ~ m o c2 ) i’* = 0 


or 

- V (ft 2 V M + m 0 2 c 2 ) + (ft 2 + mlc 2 ) ip* = 0 

^ (V>* V^/> - V'V'V*) = = 0 . (1.25) 


The four-current density is therefore 

U = ^ • (1-26) 


Here we have multiplied by ih/lrriQ , so that the zero component y’o has the 
dimension of a probability density (that is 1/cm 3 ). Furthermore this ensures that 
we obtain the correct nonrelativistic limit [cf. (1.30-31)] below. In detail, (1.25) 
reads 


d i ft (j^chp .,dip*X 
dt [2moc 2 dt ^dtj 

+ div ( ^ ) [r(y*p) - = o . (i . 27 ) 


This expression possesses the form of a continuity equation 

dg . 

+ div 7 = 0 . 

dt 


As usual integration over the entire configuration space yields 

1 1^=1 L ^ = - L dF = o 

Hence, 


const. 


I g d 5 x = 

Jv 

i.e. f v g d 3 x is constant in time. It would be a natural guess to interpret 


(1-28) 


in 

IrtiQC 2 


'.dip ar 


( 1 . 29 ) 


as a probability density. However, there is a problem with such an interpretation: 
At a given time t both ip and dip/dt may have arbitrary values; therefore, q(x, t) 
in (1.29) may be either positive or negative. Hence, g(x, t) is not positive definite 
and thus not a probability density. The deeper reason for this is that the Klein- 
Gordon equation is of second order in time, so that we must know both tp(x, t) and 
dip(x, t)/dt for a given t. Furthermore there exist solutions for negative energy [see 
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(12.4) and (1.38) below]. This and the difficulty with the probability interpretation 
was the reason that, for a long time, the Klein-Gordon equation was regarded to be 
physically senseless. One therefore looked for a relativistic wave equation of first 
order in time with positive definite probability, which was finally derived by Dirac 
(cf. Chap. 2). However, it turns out that this equation has negative energy solutions 
too. As we have previously remarked and as we shall discuss in greater detail later, 
in Chap. 2, these solutions are connected with the existence of antiparticles. 


1.3 The Nonrelativistic Limit 

We can study the nonrelativistic limit of the Klein-Gordon equation (1.21). In order 
to do this we make the ansatz 


WV) = p(r,t)ex p [ -~m§c 2 t 

n 


(1.30) 


i.e. we split the time dependence of ip into two terms, one containing the rest mass. 
In the nonrelativistic limit the difference of total energy E of the particle and the 
rest mass m^c 2 is small. Therefore we define 

E' — E — rrioc 2 

and remark that the kinetic energy E f is nonrelativistic, which means E s <C m<\c 2 . 
Hence, 


..dip . J 

m— « E p moc p 
holds also and with (1.30) we have 


(1.31) 


dp . m$c 


— i 


dxjj 
~dt = 

d 2 p 
dt 2 dt \ dt 


dt 
d [dp 




— 1 - 


.mo<r 


^jexp^-moc-f 


.moc f l 2 

— i — - — pcxp [ —-rriQC t 


. m$c 2 dp . moc 2 dp m^c 


2^4 


h dt 
n 2 


— 1- 


,2mc\c 2 dm m 2 c 
1 h ~dt + 


h dt h 2 

2^4 1 


-p 


exp [ --moc t 


exp 


--rriQC^t 

n 


Inserting this result into (1.21) yields 


1 


. IniQC 2 dp m^c 4 
1 + 


h dt 


h 2 


( &_ cP_ 

\9a 2 + dy 2 + dz 2 


exp 

m^c 2 


--m 0 c-t 

n 


pex p 


niQC t 


.dp h 2 ( d 2 d 2 d 2 \ h 2 A 

lh dt ~ 2 mo (&c 2 + dy 2 + dz 2 ) * ~ 2m 0 V 


(1.32) 
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This is the free Schrodinger equation for spinless particles. As the type of 
particle which is described by a wave equation does not depend upon whether the 
particle is relativistic or nonrelativistic, we infer that the Klein-Gordon equation 
describes spin-zero particles. Later on we will obtain this important result in a 
totally different manner by making use of the transformation properties of the 
Klein-Gordon field ip. 


1.4 Free Spin-0 Particles 


Previously we have remarked that in a relativistic theory the concept of a free 
particle is an idealization. Furthermore spin-zero particles, like pions or kaons, 
interact strongly with other particles and fields. Nevertheless we can discover some 
of the practical methods for dealing with these problems by studying the free 
solutions of (1.21). We return to the interpretation of the current density (1.26) that 
we discarded due to g in (1.29) not being positive definite. As usual integrating 
the continuity equation (1.28) yields 

l f d3 * = § / = ~ l divi d3 * = - J/ ■ dF = 0 ■ 

which means that 

/ g d 3 x = const. , (1.33) 

Jv 


i.e. the constancy in time of the normalization (which is a reasonable result). The 
question remains of how to interpret g and j. The probability interpretation is not 
applicable, as we have just seen in context with (1.29). However, we do have the 
following alternative: We obtain \hc four-current density of charge by multiplication 
of the current density (1.26) with the elementary charge e to give 


K = 2^ = {cq', - j'} , (1.34) 

where 


i he 

ImQC 2 


' dip dip * 


signifies the charge density , and 


i eh 
2m 0 


(ip*Vip - ipVip*) 


(1.35) 


(1.36) 


denotes the charge-current density. The charge density (1.35) is allowed to be pos- 
itive, negative or zero. This equates with the existence of particles and antiparticles 
in the theory. By calculating the solutions for free particles , we may understand 
this still better. Starting from (1.22), written in the form 


- mlc 2 ) ip = 0 
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and from the ansatz (1.23) for free waves 


i' = a exp ( -jV^ 


A exp 


h 


(p * x - Et) 


we obtain the necessary condition that 


P^Pn - mlc 2 = o =p*-p 1 - rn&c 
or, because of po = E /c, 

E 2 = c 2 ( p 2 + mlc 2 ) . 


2.2 


(1.37) 


Consequently, there exist two possible solutions for a given momentum p : one 
with positive, the other with negative energy, 


e p = ±c \Jp 2 + » a±) = a±) ex p 


-(p-x^\E p \t) 


(1.38) 


A(±) are normalization constants, to be determined later. Inserting this into the 
density formula (1.35), we find 


■ 


(1-39) 


This suggests the following interpretation: specifies particles with charge -he; 

0(_) specifies particles with the same mass, but with charge —e. The general 
solution of the wave equation is always a linear combination of both types of 
functions. This point may be further clarified by discretizing the continuous plane 
waves (1.23). For that purpose we confine the waves to a large cubic box ( normal- 
ization box ) with edge length L (see Fig. 1.2) and, as usual, we demand periodic 
boundary conditions at the box walls. This yields in the well-known manner 


VVi(±) = K{±) exp 

where 


T ip n ' x T E p j) 


(1.40) 


2tt 

Pn = —n , n = {n x ,n y ,n z ) 



Hi £ N 


and 


Up, = c \Jpt + m lc 2 = E n 


Fig. 1.2. The figure shows 
a normalization box. Two 
standing waves along the x 
axis are indicated 


(1-41) 


Here n is a (discrete) vector in the lattice space with axes n x ,n y) n z . Using (1.39), 
the normalization factors A(±) are determined by the requirement that 


±e = d 3 xg(±)(x) ± 

JO 


eE , 


Pn 


m 0 c z 


|A„, 


(±)i 


I 2 T 3 


Choosing the phases in such a way that the amplitudes are real, we get 

^n(dh) ' 


! moc * 

Ue 


'Pn 


(1.42) 
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and thereby 


tyn(±) = 


moc 2 

E 5 E Pn 


exp 



x 




(1.43) 


Notice that the normalization of either type of solution (corresponding to pos- 
itive and negative charge) is the same. The only difference is due to the time 
factor exp(=bi E Pn t/h). The most general solutions of the Klein-Gordon equation 
for positive and negative spin-0 particles read then as 


A+) = = JTjT ex P 


l (.Pn • X - E P j) 


and 


V>(~) = b n^n(-) = b "\l THT eX P 

n n V ^ ^Pn 


[Pn * X + E p J) 


(1.44) 


respectively. Solutions for spin-0 particles with zero charge can be constructed too. 
One immediately recognizes from the form of the expression for the charge density 
(1.35) that the Klein-Gordon field iJj has no be real for neutral particles , in which 
case 


) 


(1.45) 


By means of (1.43) we can easily describe a wave front for a neutral particle: 

^»<0) = 71 ^" +)(Pn) + 


I mo£ 
2L 3 E, 


: Pn 


moc z 


IDE, 


Pn 


exp 


exp 


s 

8 

1 

+ exp 

^ {-P n -X + E p j) 

JPn-X- E pJ) 

+ exp 

1 

S 

8 

1 

L 1 


moc ‘.2coJ P -' X - E '- ‘ 


2L 3 E, 


' Pn 


(1-46) 


Thus holds and, therefore, according to (1.35) 


y (0) 


Q* = 


i he 

2 m 0 c 2 




d'fn 


( 0 )" 


( 0 ) 


dt 


rn(Q) 




( 0 ) 


dt 


= o 


Furthermore, we realize that the current density j\x, t) of neutral particles (1.36) 
vanishes too. Consequently in this case there is no conservation law. Obviously the 
relativistic quantum theory necessarily leads to novel degrees of freedom, that is 
so say the charge degrees of freedom of particles. In a nonrelativistic theory free, 
spinless particles can propagate freely with a well-defined momentum p. In the 
relativistic case of free, spinless particle, three solutions , which correspond to the 
electric charge (+, — ,0) of the particles, exist for every momentum p. 
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EXAMPLE 

1.1 The Charged Klein-Gordon Field 

Hitherto we have inspected the Klein-Gordon equation both for a real, i.e. un- 
charged, and for a complex, i.e. charged, scalar field, in the case of the complex 
field we specified a current 

^ - ^ v V) , (1) 

with dj ‘‘ I dx " — 0 and a charge 

Q = ( ieh/2m 0 c 2 ) j d 3 x((p*0 - pp*) . (2) 

Now we want to examine charged fields in a little bit more detail. To that end we 
decompose the complex wave function <p(x) into real and imaginary components 
as 


</>(*) = MO) + m(*)] , 

where p\(x) and <^ 2 (*) are real. If p(x ) fulfills the Klein-Gordon equation 


□ + 


2 2 

m 0 c 

h 2 


<p(x) = 0 


( 3 ) 


( 4 ) 


then it immediately follows that <p\ and p> 2 also obey the Klein-Gordon equation, 
i.e. 


□ + 


m\c 2 


<f](x) = 0 


and 


□ + 


mjc 2 

~W~ 


Vi(x) = 0 


( 5 ) 


Conversely the following holds: If two fields <f\(x) and -fiix) separately fulfill a 
Klein-Gordon equation with the same mass m = m\ — m 2 , then the equations can 
be replaced by one equation for a complex field, i.e. 


<p= i ip 2 ) 


and 


fulfill 


<P 


V2 


(ip : - i ip 2 ) 


□ + 


WIqC 2 

h 2 


(p = 0 and I □ + 


m^c 2 

~W r 




-0 


( 6 ) 

( 7 ) 


By interchanging <p and <p* in 

\ & ti r 

Q = ^^2 J d3 *<W - w>*) (8) 

we obtain the opposite charge. Hence <p and <p* characterize opposite charges. 
These studies can, for example, be applied to the pion triplet (tt + , tt _ , tt°): The tt 0 , 
being a neutral particle, is characterized by a real wave function, whereas i r + and 
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Example 1.1. 


7 r , being charged fields, have to be represented by complex wave functions. tt + 
and 7T“ have the same mass and opposite charges, i.e. we can define 


^7T + = = 

^7T- = <P = 


V2 


(<P\ - i <p 2 ) 


V2 


(if: + i <p 2 ) 


and 


(9) 


<^7r<> = <A) = <£o 


( 10 ) 


Before we start to discuss the degrees of freedom, with respect to charge in 
detail, let us consider the energy. Is the energy positive (for positive particles), 
negative (for negative particles) or even equal to zero (for neutral particles)? In 
order to answer this question, we have to discuss the energy of the Klein-Gordon 
field within the scope of the general canonical formalism. 


1.5 Energy-Momentum Tensor of the Kiein-Gordon Field 


In classical mechanics the energy is always given by the Hamilton function H 
which depends on the Lagrange function by means of the relation 


H = ^7 Tiqi -L 


( 1 . 47 ) 


The classical equations of motion are obtained from the variational principle 


■S 


6 / Ldt = 0 


which leads to the Lagrange equations in the familiar way, 3 i.e. 
d dL dL 


dt Oq, dq i 


= 0 . 


( 1 . 48 ) 


( 1 . 49 ) 


The field theory is based on a similar concept 4 where the starting point is the 
Lagrange density 


£ fp<r, 


dj>q 

dx u 


( 1 . 50 ) 


from which one obtains the Lagrange function by integration over the three- 
dimensional volume V 


3 See H. Goldstein: Classical Mechanics, 2nd ed. (Addison- Wesley, Reading, MA 1980) or 
W. Greiner: Theoretische Physik II: Mechanik II (Karri Deutsch, Frankfurt a.M. 1989). 

4 See J.D. Jackson: Classical Electrodynamics , 2nd ed. (Wiley, New York 1975). 
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L = 



d 3 x 


(1.51) 


In general, the Lagrange density C depends on the various wave fields i/j^ and all 
their derivatives dip a / dx^. Derivatives of higher order are not considered because 
they would lead to nonlocal theories. So the variational principle (1.48) reads 

8 j Ldt = 8 j j d 3 x dt 

= S I C (*” §i~) d4, = 0 <152) 

and yields in the well-known way (see the following Exercise 1.2) the field equa- 
tions 

d dC dC _ n 

dx“‘ jdx'r 1 ) dxjj a U ’ KL-JJ) 

Within the theory of wave fields (1.53) represent the equations of motion and 
are analogous to the classical equations of motion (1.49). Note the similarities 
and differences between (1.49) and (1.53): the variable i (time) distinguished in 
(1.49) is of the same significance as all other coordinates of x M in (1.53). In (1.49) 
one considers the Lagrange function L, whereas in the field equations (1.53) L is 
replaced by the Lagrange density C. 


EXERCISE 


1.2 Derivation of the Field Equations for Wavefields 


Problem. Derive the equations of motion (field equations) from the variation prin- 
ciple for the fields c) 


sf cU,^ 

Jvs V ox u 


d 4 x = 0 . 


fv 4 \ 

Solution. The variation is defined by 


-1 


dVv \ ,4 


dx v- 
dC 


d 4 x 


S'tpa + 


dC 


d-lpcr \ 


d 4 x = 0 


( 1 ) 


_di/j a ’ ' d(dxj) a / dx* 1 ) \dx^ )\ 

We make use of the possibility of interchanging variation and partial differentiation, 


s 9 *' 


dx v dx v 


(to 4 " & to) ~ n To ~ Q n (&To) 


dx v 


dx^ 


( 2 ) 


The second term in (1) is integrated by parts, taking into account that the variation 
of vanishes at the integration boundaries. This leads to 
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Exercise 1.2. 



d d£ 
dx v d^dipe/dx^) 


d 4 x 


= 0 


(3) 


As these equations are valid for arbitrary variations Sip^, one obtains the field 
equations {Euler— Lagrange equations ) 


dC _ d dC 
dip a dx^ d{dip a /dx^) 


(4) 


We will now demonstrate the procedure of the Lagrange formalism for the 
Klein-Gordon field. In this case we deal with two wave fields ip and ip* (for 
charged particles). We could also choose the real and imaginary part of ip as 
independent fields, but prefer the choice of ip and ip*. Now, the Lagrange density 
for the Klein-Gordon field is of the form 




dip dip * 
dx ^ ’ dx v 


2 m 0 \ dx' 1 dx 1 ' 


2 „2 


mXc 

~¥ 


-Ip* Ip 


(1-54) 


The constant h 2 /2mo is chosen in such a way that f C d 3 x has the dimension 
of energy, so that in particular the plane waves (1.43) carry the energy E P . [see 
(1.61) below]. The proof that this expression is the correct Lagrange density for the 
Klein-Gordon field is given by showing that, with the help of (1.53), one obtains 
the Klein-Gordon equations for ip and ip * as resulting field equations. Wc note 
that one can immediately recognize from (1.54) that C is a Lorentz scalar if ip and 
ip* are scalar fields, which we assume. Because of the covariance of (1.53), the 
resulting field equations (in our case the Klein-Gordon equation) are also Lorentz 
invariant. Indeed, by inserting (1.54) into (1.53) one immediately obtains 

9 9C _ ?£ = o = + 2^*) 

dx v d(dip/dx y ) dip 2m 0 v <9x 1 ' dx ^ h 2 ) 


or 




d_ 

dx** dx v 


2 2 

rnxc L /# 

ip + = o 

n z 


and, analogously for the ip* field, 


9 _^ = o = l— ( 1 + m ^ C f* 

dx v d(dip/dx u ) dip 2m 0 {dx" dx v h 2 


or 




_d d_ 

dx u dx v 


2 2 

i> + = 0 

h l 


(1.55) 


(1.56) 


Hence, the Lagrange density (1.54) yields precisely the Klein-Gordon equation for 
the ip field as well as for the ip* field. 


1.5 Energy-Momentum Tensor of the Klein-Gordon Field 


The energy and momentum of the wave field are described by the energy - 
momentum tensor , 5 defined as 


v = £ 


dip 0 


dC 




dx v d{dib a / dx y ) 

The energy density H(x) of the field is identical with Tj, 
H(x) = T°(x) . 

In our case one obtains from (1.54) that 

T V = dip dC ^ dip* dC _ r„ v 

^ dx^ d(dip/dx u ) ' dx^ d(dip* / dx v ) 


(1.57) 


(1.58) 


ry_ 

2m 0 


9 

\ 

and, hence 
H{x) = T° 

= 

2mo 


' ' av dipydy_ dip dip* 
dx a dx^ 1 " dx a dx^ 1 

ae 3 dip m 2 c 2 


dx e dx e 


h 


-4>*ip 9 / 


(1.59) 


' „a 0 dyyd± , „<ro dip dip* 
y y 


dx a dx° 


dx a dx° 


- 1 r**) 


2m 0 


dip* dip t dip dip 


dx° dx Q dx° dx° 
3 


dip* dip uiy uip 

0 ^ dx* dx l 


dip* dip m^c 2 


JL 

2m 0 


dx° x 0 

i = 1 

1 dip* dip 


h 2 


ip*ip 


: 2 dt dt 


+ <yip*)-(Vip) + 


mfic 2 

~W 


ip*ip 


(1.60) 


The energy H , which belongs to the plane wave solutions (1.43), is given by the 
integral, over the energy density in the volume L 3 , 


I Tg(n,: 

JO 

= f — 

JO 2m <> 


ff»(±) = / 7’o>,±)d J x 

Jo 

ty 

to 2m <> 
h 2 moc 


m 0 c 2 (tE p „) 2 m 0 c 2 p n -p n mfic 2 m 0 c 2 

' T '1 T- Z-O ”1” 


OE Pn k 2 c 2 l}E Pn h 2 


h 2 l 2 e, 


Pn 


d 3 x 


2 mo L 2 E P h 2 


2 rp 2 

“t +P 2 n +m 2 c 2 


h 2 


m 0 c 


2 2 E, 


2m 0 E i E Pn h 2 c 2 


— L 3 = 


'Pn 


(1.61) 


See Example 1.3 and, for a detailed discussion, J.D. Jackson: Classical Electrodynam- 
ics , 2nd ed. (Wiley, New York 1975) or W. Greiner: Theoretische Physik HI, Klassische 
Elektrodynamik (Harri Deutsch, Frankfurt a.M. 1985). 



16 


1. Relativistic Wave Equation for Spin-0 Particles 


Although we chose the constant h 2 /2rriQ of the Lagrange density (1.54) in such a 
way that the wave carried the energy -\-E Pn , the result (1.61) shows that the 
wave ipn(-) also carries energy +E Pn . Thus, we have found the interesting result 
that plane waves ip n (±) f rom (143) describe particles with positive and negative 
charge, respectively [see (1.39) and (1.42)], but that both waves carry positive 
energy +E Pn = +c(p 2 + m^c 2 ) 1 ^ 2 . Hence, energy plays two roles: On one hand 
-f -E P characterizes particles with positive charge according to (1.39) - the plane 
wave in this case reading as ^(+) ~ exp[i(p • x - E p t)/h] - and —E P , particles of 
negative charge - the plane wave in this case reads as ip(-) ~ exp[i (p • x + E p t)/h]. 
On the other hand \E p \ always gives the energy of the particles. 


EXAMPLE 


1.3 Determination of the Energy-Momentum Tensor 
for a General Lagrange Density £(0 C r , dxp^/dx^) 


In order to calculate the energy-momentum tensor we start with the Noether theo- 
rem: Each continuous symmetry transformation which leaves the Lagrange density 
invariant, corresponds to a conservation law and, hence, to a constant of motion. 
Specifically we shall discuss the conservation laws, which follow from the trans- 
lational invariance of classical field theory. 

Consider the infinitesimal displacement 


: X u + £ u 


Sx n = x l - H = 


The corresponding variation of the Lagrangian is given by 


( 1 ) 


6C = 


dC c dC 

OXn — £fi 

OX n OXn 


( 2 ) 


!f, on the other hand, C is translationally invariant then £ does not explicitly depend 
on the coordinates. Thus, we can write £ = C(xp a ,dxp (7 /dx IJ/ ). From this follows 
the variation 


a 


' dC f ; dC 

' d(dtp IJ /dx IL f 


dx,j 


(3) 


The variation of xjj a is obtained as 


f _ ( drp a (x) £ _ dlp a __ 

Vyc t — 0 OXy — ^ 


<Z,y 

(jXy UXy 

In the following we will also use the Euler-Lagrange equations (field equations) 


(4) 


dC d dC 

dip a dxi 1 d(chp a / dx^) 


( 5 ) 
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Equating (2) and (3) yields 


dC 


KJ As \ 


dC 


Slpa + 


dC 


-E 


9 


dip a d(dtp a /dx,j,) 

dC j 9t/>, 


dip* 

dx,. 


[ [ dx t* didder /dx^) 
where we have used 


+ 


dC 


d ( dtp, 


dx u d(dtp c / dx fL ) dx v \ dx 


dtpo 

dx,. 


d ( dtp a 


dx„ \ dx 


ftXv — 


d ( dtp a 


dx,, \ dx, 




( 6 ) 


(7) 


The rhs of (6) can be expressed as 


dC 


d 


' dx^ dXf, 


E 


<9£ 


dtp a 


d{dtp a /dx^) v dx v 

Since the result holds for arbitrary translations s )L we can write 

d 

^-t^ = o . 

dx,. 


( 8 ) 


(9) 


These are four continuity equations (one for every v) and, thus, four conservation 
laws. 1),,, is the energy-momentum tensor (stress tensor), given by 


7/ti/ — -f- y ^ 


dC 


dtp a 


d{dtp a /dx tl ) dx v 


( 10 ) 


For a better understanding of its physical content we will illustrate the meaning of 
the Too component. For that reason, in analogy to classical mechanics, where the 
generalized momentum is given by p n = dL/dq„, we define the momentum-density 
conjugate to tp a (x,t) by 


Tt<r(X,t) = 


dC(tpq,dtpa/dXy) 

dlp 0 (X,t) 


(ID 


Here the short-hand notation tp a = dtp a /d(ct) has been used. The classical Hamil- 
tonian reads 


H(p,q)=pq ~U.q,q) 


( 12 ) 


Analogously, we express the Hamiltonian as a volume integral over a Hamiltonian 
density TL(it, tp) with the definition 


d 3 x H(n(x,t), tp(x,t)) , 


(13) 


and 


H — ^7 T a tpa - £ . 


(14) 


Example 1.5. 
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Example 13. 


Using (14) and (11), we find that (10) yields 


d 3 x T Qu — j d 3 x 

with dp u /dt = 0. The quantities p u are therefore constant in time, which follows 
from (9), because f d 3 jc dT<\ v jcdt — — f d 3 xdT iLJ /dxi — 0 according to GauB’s 
theorem. For the Too component of the stress tensor it clearly follows that 



E 


ft a 


dipo 

dx v 


— 9oi/£ 


(15) 


Too = Ej ’’’frVv - C = 7i 

a 


(16) 


and, therefore, from (13) 


/ 


d 3 x Too 


= H 


(17) 


Hence, we can identify p v with the energy-momentum four- vector. Its time com- 
ponent is clearly the energy, and also each component of p v is conserved in time, 
i.e. it is a constant of motion. 


EXERCISE 


i.4 Lagrange Density and Energy-Momentum Tensor 
of the Schrodinger Equation 


Problem. Derive the Lagrangian for the Schrodinger equation and discuss the 
corresponding energy-momentum tensor. 


Solution. The Lagrangian is given by 


C 


2mo 


(WO • (Vtf o 


h 

2i 




* dip dtp* 


dt dt 


ip - xp*Vip 


( 1 ) 


Using the Euler-Lagrange differential equation we will show that the variation 
of this Lagrangian yields the Schrodinger equation, ip and ip* have to be varied 
independently. 

The Euler-Lagrange equation, split up into space and time components, reads 


dC _ d dC _ d d£ 
dip a dx' d(dip a /dx l ) dt dip a 


( 2 ) 


where the summation over i runs through / — 1, 2, 3 or 

d_dL_ _ dC_ dC 

dt dip a dip a V d(Vip a ) 


( 3 ) 


First we vary with respect to ip* and obtain 


-Vip + ^—V 2 ip - "ip = 0 
2mo l 


( 4 ) 
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Analogously, variation with respect to 'ip yields 

-vr + ^-vy + -f = o . (5 ) 

2 m 0 l v ’ 

Rewriting (4) and (5) yields the Schrodinger equation for ip and ip* in the familiar 
form, 

h 2 _ 

i hip = -~ — V 2 ip + Vip = Hip , 

2 m 0 


H 2 

-Ihip* = -- — V 2 ip* + Vip* = H^ip* , 
2 m 0 


( 6 ) 


where 


h 2 


H = - — A + V(x) 

2m 0 

is the Hamiltonian. The conjugate momenta to ip are 
8C i h 


7 T = r = —'lb* , 

dr/) 2 ^ 7 


and 


7 T = 






4 > • 


d'ip * 2 

Hence, the Hamiltonian density reads 

H = - C= - — VwVip- %Vtv<P . 

m 0 h 

We also determine the stress tensor belonging to the Lagrange density (1), 

T v = dip DC dip* dC _ 

* dx» d(dip/dx v ) ' dx» didip* jdx v ) ' 

In particular the 7 q u component, split up into space and time components, is 

= -%■&!>* + P%^P + • ViP + _ ip*^) + ^*Vtp 

zi L\ Zmr\ zi 


( 7 ) 


( 8 ) 


( 9 ) 


( 10 ) 


h 2 
2 m 0 


Vip* - vip + ip*vip = n , 


(ii) 


that is, in accordance with (9). 7o° gives the energy density of a given system. The 
total energy of the Schrodinger field is then 

H= I T 0 °d 3 x = I (^-Vip* -ViP + ip*Vip] d 3 * 

= J ip* ^-’V 2 + ip d 3 x = j" ip*Hipd 3 x , 
i.e. the expectation value of the Hamiltonian. 


Exercise 1.4. 
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Exercise 1.4. 


Calculation of the energy flux S : In analogy to the roynting vector in electro- 
dynamics, S = E x J3, the canonical formulation generally follows as 

S = e\T(f + £2 To 2 + 3 j (12) 


where e, are the Cartesian unit vectors. In the case of the Schrodinger field we get 


S 




ac 


dyS/'ly) 






2m 0 


(13) 


Calculation of the momentum density p : 
P = ^1^1° + £2^2° + ^3^3° 


Ci s* 

= (vvo^f + W) 

dip 


dC 

dip* 


= -^(rvip-ipvr) 

- 2i 


(14) 


This is the expression known to us from elementary quantum mechanics. 6 The 
remaining components of the tensor are called momentum fluxes. 


EXERCISE 


1.5 Lorentz Invariance of the Klein-Gordon Equation 


Problem. Show the Lorentz invariance of the Klein-Gordon equation. 


Solution. The Lorentz invariance of the Klein-Gordon equation is a direct con- 
sequence of the invariance of the underlying energy-momentum relation. In four- 
vector notation it reads 

3 

'Y^PuP' 1, = PnP >t = E 2 /c 2 -p-p = m%c 2 ■ 
u=o 


Now it is helpful to express the Klein-Gordon equation in four-vector notation, i.e. 


-ft 2 0 = (~h 2 c 2 V 2 + mic 4 ) 


dt 2 
changes to 


□ + 


m 


— 0 , where 


□ = 


_d d_ 

dx^ u 


6 See W. Cireiner: Quantum Mechanics - An Introduction , 3rd ed. (Springer, Berlin, Hei- 
delberg 1994). 
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Hence, in the transformed system the Klein-Gordon equation should read 


j9 d_ 

dx[. dx'v 

H' 


|a Z 


vXx') — 0 with K — 


m$c 


thus, it has to be shown that the operator ( d/dx^)(d/dx is invariant under Lorentz 
transformations. This may easily be seen, because 


p n = +i h 


d 
dx v 


and, therefore, 


_a d_ 

dx^ 




The length of the vector p M is, as stated before, Lorentz invariant 
the operator □ is too. 

The transformation properties of the wave function can easily 
looking at the plane-wave solutions of the Klein-Gordon equation, 


and, therefore, 
be deduced by 


y\ x ll) = z lkfjXH , with ko = — = Vk l + x? . 

c 

The difference between t/j'ix' ) and ip'ix^) should be noticed: and 7//(xp refer 

to the same space-time point, i.e., x^ — Ylt=o where is a Lorentz trans- 

formation, whereas ^(xp and ^(Xy) refer to two different points with coordinates 
Xp in the old and new system, respectively. 


Exercise 7.5. 


L6 The Klein-Gordon Equation in Schrodinger Form 

To demonstrate the new degrees of freedom of the charge in a more distinct way, 
it is advantageous to transform the Klein-Gordon equation (1.22) - which is of 
second order in the time coordinate - into a system of two coupled differential 
equations that are of first order in time. This is achieved by the ansatz 

dijj 

i> = <f + x , ‘^^7 = m oc 2 (v? - x) , C 1 - 62 ) 

in which ijj and the time derivative d'lp/dt are expressed by the two functions ip 
and x- According to (1.22), the Klein-Gordon field fulfills 

1 d 2 , f d 2 d 2 d 2 m 2 c 2 \ , 

c 2 dt lV ~ \dx 2 + fy 2 + Ih 2 ~ ^ ‘ 

It is easily proved that the two coupled differential equations, 
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are equivalent to the Klein-Gordon equation (1.22). By adding and subtracting the 
equations (1.63), the following arguments can be made: 

(a) addition yields 

d 2 

+ X) = m o c (<P - X) ■ 

This is the second equation of (L62); it leads to the trivial equation drh/dt = 
drb/dt, 

(b) subtraction yields 

d ft 2 

-x)= - X) + rn 0 c (cp + x) , 

at mo 

or, using (1.62), 
d ( \ h 


i h 


dt \m 0 c 2 dt 
H 2 1 d 2 rp 
mo c 2 dt 1 
1 d 2 rjj 


- 


h l 2 

= Arp + m 0 c rp 

m Q 

h 2 . , , , 

= Zlrp + moc^rp 

m 0 


c 2 dt 2 


= Axj) — 


TTIqC 2 

nr 




which is just the Klein-Gordon equation (1.22). 

The coupled equations (1.63) may be combined to form one equation. For this 
purpose we introduce the column vector 


V = 


(1.64) 


and make use of the four 2x2 matrices 

0 1\ „ / 0 -i' 

1 0 I ’ T2 _ I i 0 


T3 = 


11 = 


1 0 
0 1 


(1.65) 


These are identical to the Pauli matrices, 7 with the significant difference that 
the matrices (1.65) do not act in spin space, but in the vector space defined by 
(1.64). The Pauli matrices fulfill the algebraic relations 


t 2 = 1 


f k fi = -fif k = i f m {k.l.m = 1, 2, 3 - cyclic} 


( 1 = 66 ) 


Using (1.64-66) we can combine the coupled equations (1.63) to form a Schrodin- 
ger-type equation, namely, 


ih— & = Hf& 
at 


or 


i/4 -ft!* 


(1.67) 


7 These were introduced in W. Greiner: Quantum Mechanics - An introduction , 3rd ea. 
(Springer, Berlin, Heidelberg 1994). 
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where the Hamiltonian He for free particles is given by 


P 2 


H f — (f 3 + tr 2 ) r f 3moC 2 

jLTYIq 


1 l\ p , (\ 0\ 2 /i .ox 

■i -1)2^ Ho -1 1 ""' - < L68) 


Hence, in (1.67) we have found a Schrodinger formulation of the Klein-Gordon 
equation. Starting from (1.67) and using the relation 


Hf = c 2 p 2 + mfjc 4 


d-69) 


it is easily proved that each component of the vector & of (1.64) individually 
satisfies the Klein-Gordon equation. This is most elegantly shown by applying 
(i djdt + Hf) to the Ins of (1.67), yielding 

“£ + *■)(“£- 4 * =° 


= 0 


+ h 2c2 A - m%c 4 ) V = 0 


or 


Id 2 A m^c 2 


dt 2 


h 2 


<P = 0 


(1.70) 


In this representation the expression for the density (1.35) becomes especially 
simple. With (1.62) we find 

, ieh ( dip dip* 

e = 2n^V' a?- 4 " -W 


emnc 


{nv -x)+ mv* - x*)) 


2moc 2 

= | [(<H + x*)(v -x) + (v + x)<H* - X*)] 

= \ bV - X*X - + W* - XX* 

= e {<p*<p - x*x) 

= e&f& . (1.71) 

Similarly, from (1.36) we can infer the current vector in Schrodinger representation 
./ eh 


3 = 


2mQ\ 


7 [^T3(f3 + if'2)Vlf r — (W 1 )^ + irffP] 


(1.72) 


For the normalization of charge it follows that 

/ f ' w d ,i=± ' ■ ° r 

[&t 3 ¥d 3 x = ±1 = f (w? - XX*)d 3x • 


(1.73) 
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Let us once again consider free particles in this new representation. If we write 


& = ( ^ ) = A[ J exp 

<XJ Vxo. 


h 


(p • x - Et) 


and substitute this ansatz into (1.67) by means of (1.68), we find 


or, alternatively, 


1 + 

-1 -l ) 2 m 0 \x) V 0 


0 \ 2 (V 

-1 moC u 


P 2 , .A , ... 2 P 2 


(1.74) 


(1.75) 


(1.76) 


Vo 


Ev = + X) + m 0 c 2 <p , Ex = -~(v + X)-m 0 c 2 y 

ZfflQ xLTYIq 

and xo are therefore readily determined by the solution of the coupled equations 

D 2 o\ D 2 

£ “2^“” oC V'’ 0_ ^ X0 = ° ’ 


h:, ) ,p,> ~{ t: + : kv ~ o ■ 

and, since the determinant necessarily needs to vanish, 


(1-77) 


r P 2 

E moc 

2 mo 

+ f 

2 m 0 
It follows that 


2mo 

P 2 7 

E + b m 0 c 2 

2 m 0 


= 0 


E 1 - = 0 

\2m 0 J \2m 0 ) 


We thus recover the relativistic energy-momentum relation 

E 2 — p 2 c 2 + iuqC 4 or E = ±c p 2 + m^c 2 = ±E P , 

the corresponding solutions following from (1.77) and (1.74). We shall discuss the 
positive and negative energy solutions separately: 

(1) E = +E p : 


V«\p) = A (+) Q +) J exp [i(p • x - E p t)/h] = 
where 


Y?0 +)> \ _ f rn 0° 2 + Ep\ 

•Xo +) J - Woe 2 - E P J 


(1.78a) 


(1.78b) 


and 
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a+) = 


V'V? y/ 4 m 0 c 2 


(1.78c) 


Equations (1.78) are readily understood if we refer to (1.76). from which we can 
deduce that 

(E p - m 0 c 2 ) v4 +) = - (E p + m 0 c 2 ) X ^ +) or ^ = m ° C l± t E^ 


rriQC 2 — Er 


-Xo 


Choosing 

Xo +) = m o c 2 - E p , 

it consequently follows that 
<^o +) = m oc 2 + Ep . 

Equation (1.73) allows us to calculate the normalization constant A( +) from 

\ A (+)\ 2 J (^o +) Vo _) - Xo +) Vo +) ) <?x 

= |A (+) | 2 L 3 \(moc 2 + Ep) 2 -(m 0 c 2 -EpY 


= 1 


11 we chose the phase to be real, the result is 
1 1 


^(+) - 


v / L 3 (4m 0 c 2 Ep) \] Am^c 2 yj E?E p 

We proceed similarly in the other case. 

(2) E — -Ep -. 

^-\p) = A ( _) exp [i(p • x + E p t)/h] = 


where 


X^ 


m 0 c 2 -E p 

2 , J 1 and 
m 0 c 1 +E Pj 


A (~) = ^(+) 


1 


1 


1 


y/4m <i c 2 L' } Ep yj 4 m 0 c 2 y/L?E p 

In the nonrelativistic limit we obtain 


(1.79a) 

(1.79b) 

(1.79c) 


E p = cyjp 2 + m%c 2 = m 0 c 2 4 / 1 + 


pL 


l 2 C 2 


moc 2 ( 1 + 1 ) = m 0 c 2 + P 

2 miqC 1 


m^c A 

2 

2mo 


(1.80) 


and, hence, 
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f A (+)<Po‘ ) \ 


f ( m 0 c 2 + Ep)/ ^ E p 4m 0 c 2 '\ 


Va (+) Xo + V VU y.(m 0 c 2 - E p )/ yj E p 4m 0 c 2 J 


1 / 2m 0 c 2 /2m 0 c 2 \ 

\[T? \[-p 2 /(2m 0 )\/2m 0 c 2 ) 


(1.81) 


{A ( -)<p ( 0 ^ _ 1 ( ( mpc 2 - E p )/ y/E p 4m 0 c 2 \ 

Va ( _)X^ ) 7 “ y/U \(m 0 c 2 + E p )/ y / Ep4m 0 c 1 J 


->=(-* (?n + 

VL 3 \ i ) y/c^o VL 3 \i/ 


(1.82) 


Thus, we can see that in the nonrelativistic limit for states with positive charge, 
the upper component is large and the lower one is small and vice versa for states 
with negative charge. 


1.7 Charge Conjugation 


By comparing (1.78a) with (1.79a), we may write down the relatior 

Vx l T-p)/ \<p k±) *(+p)J 

= (x^(+P)\ = w 

W + H+p)J 1 ^ p; 

This can be interpreted in the following way: If the state 


(1.83) 


(1.84) 


reoresents a oositive charge, then the state 


C'PC 1 = = T\$* = 


(1.85) 


describes a particle with negative charge. We call l P c the charge-conjugated state 
of 'E. Similarly, E is the charge-conjugated state of l P c because it obeys 


(tfOc = fi(n^) =* • 


( 1 . 86 ) 


explicitly, the charge conjugation implies the following transtormations according 
to (1.83): 


(1.87) 
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If we (arbitrarily) call the particle described by as the particle , then we call 
the particle described by antiparticles. If we call, for example, the 7r~ mesons 
particles, then the 7 r + mesons are antiparticles. Neutral particles fit into this picture 
too, in that for these the charge-conjugated state is the state itself. In other words, 
neutral particles are their own antiparticles. So we have 


& c = Ti&*=a& . ( 1 . 88 ) 

The factor a has to be real. This important point can be understood if we imagine 
that, for a neutral particle, the Klein-Gordon wave function = ip + x (1.62) has 
to be real; therefore, 

Imip = — Im x (1.89) 

must always hold. Since 0 C in (1.88) is describing neutral particles, then similarly, 
Im (aip) = - Im(ax) (1. 90) 

must hold: Both conditions, (1.89) and (1.90), necessarily lead to 
a real 

This can also be deduced from (1.88), where for neutral particles both ^ and 
are real. From 

(*c)c = & , 

it follows that 

(a£') c = t\ (a!?')* af\w* — cxoCi' — w , 

so that 

a 2 = 1 , a = ±l . (1.91) 

Accordingly there exist two different kinds of neutral particles, namely 

(a) neutral particles with positive charge parity, i.e. a = +1 

!^ c = fi!P* = iP (or <p* = x) ; (1.92) 

(b) neutral particles with negative charge parity, i.e. a = — 1 


!P C — T\^* — -!P (or (p* = — x) 


(1.93) 
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EXAMPLE 

i.6 C Parity 

C parity stands for charge-conjugation parity or, less accurately, particle-antiparti- 
cle conjugation symmetry. To obtain a general definition, we choose to characterize 
a particle state in the following way: 

W) = B,Q,L,Nn) . ( 1 ) 

This characterization is based on a set of quantum numbers with the following 
definitions: 

M = mass = energy of the system, 
p — momentum of the sy stem, 

J — angular momentum quantum number, 

A = helicity = eigenvalue of the helicity operator, 

(A characterizes the spin projection onto the momentum direction), 

B = baryon number, 

Q = charge, 

L = lepton number, 

N„ = muon number, compiled in the following table. 

Equivalently we can use the hypercharge Y, the strangeness 5 and the isospin T 3 , 
which are connected via 

g = iy + r 3 , b = y-s 

to the charge and baryon number . 8 

Value of N M for different particles 

Particle e + , e _ , All other 

particles 

N n 0 + 1_10 


The following processes are forbidden due to muon number conservation: 

7T“ — > pT + Pe \ 

+ P e+ + n / 

On the other hand reactions like 

+ p — > e+ + n 1 
^ + p * p' + n j 

are allowed. 

8 See W. Greiner, B. Miiiier: Quantum Mechanics - Symmetries , 2nd ea. (Springer, Berlin, 
Heidelberg 1994). 
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The muonic charge Nf and the lepton number L may be replaced by the alterna- 
tive lepton numbers L e and L^, which are listed, for convenience, in the following 
table. 


Value of L e and L M for different particles 


Particle e + , z/ e 

e , 


All other 
particles 

Le 

-1 

+i 

0 

0 

0 


0 

0 

-1 

+1 

0 


The introduction of L e and L^ has the advantage that the set of quantum numbers 
is symmetric with respect to electrons and muons. Clearly one has 

L = Le + L„ (2) 

and state (1) can be written as 

|i/>) = \M, p, J, A; B, Q, L e , L„> . (3) 

The charge conjugation is now defined by the equation 
C\M, p, J , A; B, Q, L e , L^) 

= \M,p,J,X,B,Q, L e , L,j ,) c 

= rj c \M,p, J, X\-B,-Q,-L e ,-L^) , (4) 

which is simply saying that the C operator reflects (i.e. changes the sign of) charge- 
like quantum numbers, whereas other properties such as M, p, 7, A remain un- 
changed. The former quantities are called intrinsic , while the latter are named 
external. The state \M , p, 7, A; B, Q, L e , L M ) is called a particle state , the state 
|Af , p, 7, A; 5, — Q, — L e , — L u ) is called an antiparticle state. Since C\) — r ip c 
should be normalized, e.g. 

{A\Vc) = i , 

and because the states (3) are also normalized, it follows that 

|77 c | 2 = 1 or ? 7 C = e la , a real . (5) 

This formal property does not yet guarantee that the operator C corresponds to 
a physical symmetry. For this, the states on the rhs of (4) have to be physically 
realized, which is the case for C in nearly all known theories; however, an important 
exception occurs for neutrinos and antineutrinos. 

Although neutrinos, in particular, are of great importance in the theory of 
weak interactions, the C conjugation cannot be chosen in this case as a sym- 
metry operation. 9 On the other hand careful investigations have shown that the 
Hamiltonians of the strong and the electromagnetic interactions have a vanishing 
commutator with C, i.e. 

9 This is discussed in more detail in W. Greiner, B. Miiiier: Gauge Theory of Weak Inter- 
actions , 2nd ed. (Springer, Berlin, Heidelberg 1996). 


Example 1.6. 
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Example 1.6. 


C //strong — //strong C , 

C //elm — //elmC . (6) 

Consequently, C is, as a matter of fact, a symmetry operation for all strong and 
electromagnetic processes. 

Let us now return to (4). If one of the charge-like quantum numbers is not 
equal to zero, then p c has no physical meaning. Since, in this case, one is free to 
choose a relative phase factor between the states \M , p, J, A; /?, Q, L e , L^) and 
| M, p, ,/, A; — /?, -Q, — Lg, —Lfj) (which are assumed to be different), then we 
can choose r] c = 1. But if particles and antiparticles are identical , then (4) turns to 
be an eigenvalue equation 

| M, p , 7, A; 0,0, 0,0) c = 7ir \M , p, 7, A; 0,0, 0,0) . (7) 

The eigenvalue r7 c is named C parity . It is natural to postulate that the double 
application of the charge conjugation C leads back to the original state. Thus, 

(I M, p, J, A;0,0,0,0) c ) c = | M,p, J, A; 0,0, 0,0) 

= »7c! m . A; 0,0, 0,0) . (8) 


Therefore, 

r)l = 1 or r? c = ±1 . 

If one considers a many-particle state, e.g. a two-particle state like 

— |01 ) 102 ) , 

where each particle state \<j>j) has the C parity rf^\ then it holds that 
C\4>\,4>2) = »7c|01 , 02) = 101 )c 101 )c 

= ^7c 1) I0i> 7 ?c 2) I02) = 7 ?c 1>? ?c 2) I0i)I02) 

= Vc ) V? ) \4>1,<)>2) ■ 


(9) 

( 10 ) 


( 11 ) 


thus, 


Vc = r]c J Vr • ( 12 ) 

Since, from (6), C should be a symmetry operation, then C parity is a conserved 
quantum number for all strong and electromagnetic reactions. To illustrate this we 
consider the following three neutral particles: 

(a) The i) meson, for which m v c 2 = 548.8 MeV, = h/r^ = 2.3 keV [1 MeV = 
(3/2) x 10 31 s~ 1 ], charge = 0, spin/parity = 0 + ; 

(b) the pi on i r°, with m n oc 2 = 134.97 MeV, o = h/r n o = 7.9 eV, charge = 0, 
spin/parity = 0“ ; 

(c) the photon, having m n c 2 = 0, F 1 = h/r y = 0, charge = 0, spir7parity = . 

For these particles, the decay processes 

V -> 7 + 7 , 
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may be observed, but the following reactions do not occur: 

// ~r 7T° + 7 

V T 4 37 

7T° 3j . (14) 

We understand both the allowed as well as the forbidden processes by assuming 
the conservation of C parity and assigning the C parities to the particles as shown 
in the accompanying table. 


Particle 7 ttq 77 

C parity —1 1 -fl 


Of course, within a given process the C parity is only conserved if there occur 
solely particles with a well-defined C parity. This is true for reactions (13) and 
(14). The negative C parity of the photon , as shown in the table, can be understood 
more precisely: The photons are coupled to all other particles by the electric current 
jn . The interaction part of the Lagrangian is 

£ eIm = L(*)A^(*) (15) 

where A m (jc) is the four-potential of the photon. Obviously, the current j^{x) 
changes its sign under C transformation, 

Cj»(x)C-' = -; M (x) . (16) 

Therefore, 

CA^(x)C~ { = -A>\x) (17) 

must hold to save the invariance of the Lagrangian (15) under C transformation, 
i.e. 


C£elmC-' =r e lm • (18) 

The positive C parities of 7r° and 77 follow from the existence of the decays (13) 
using (12). 


1.8 Free Spin-0 Particles 

in the Feshbach-Villars Representation 10 

We saw in (1.81) and (1.82) that positively charged particles possess a large upper 
component in the nonrelativistic limit (|^ <+) | |x ( 7) while negatively charged 

particles have a large lower component (|x (-) | » |<^ (_) |). Now there exists a 

111 H. Feshbach, F. Villars: Rev. Mod. Phys. 30, 24 (1958). 
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representation - the so-called ( P representation - in which the positive and negative 
solutions are always of the following form 


<P (+> (p) 


<P { \p) 


This representation is established by the transformation 

where U is the operator 

( me 2 + E p ) - fi(moc 2 - E„) 


U = l 


y/4m 0 c 2 E p 


(1.94) 


(1.95) 


(1.96) 


with E p — c(p 2 + m 0 2 c 2 )'^ and $ denotes the Hermitian conjugate of 4>\ similarly 
t/1. The 2x2 matrix U is not unitary in the usual sense (t/ _1 ^ f/t) since 


V~ l = f 3 t/f 3 = 1- 


(m 0 c 2 + E p ) + n(m 0 c 2 - E p ) 


(1.97) 


y/4m 0 c 2 Ep 

This can be seen directly with the aid of (1.66) and by the relation, 

UU~\ = l - + E pf ~ ( m o g2 ~ e p) 2 = D 4 m 0 c 2 E p = 

4nioc 2 E p 4moc 2 E p 

Besides this, according to (1.78a) and (1.79a), plane waves are transformed by 

<(> {+ \p) = U¥ + \p) 


(1-98) 


= U 


1 


1 


m 0 c 2 + E p 
y/D y/4m 0 c 2 E p \m 0 c 2 - E p 

( { m 0 c 2 + E p f 

1 (m 0 c 2 + E p ) ( m 0 c 2 - E p ) 


exp[i(p • x - E p t)/h] 


0 me 2 - Ep) 2 
{.me 2 - Ep) ( m 0 c 2 + Ep) 


y/D 4 m 0 c 2 E p 

x exp[i(p • x — E p t)/h ] 

= ~j= exp[i(p • x - E p t)/h ] . 


(1.99) 


Similarly, we get 

4> { ~ } (p) = u^~\p) 
l l 


u 


rao<r 


\/T} v /4m 0 c 2 E p \m 0 c 2 + E p 

{me 2 + E p ){m 0 c 2 - E p ) 
{m 0 c 2 + E p ) 2 


r lexpfi {p-x + E p t)/h] 


{me 2 + Ep){m 0 c 2 - Ep) 
{m 0 c 2 + E P ) 2 


y/D 4moc 2 E p 

x exp[i(p • x + E p t)/h] 

(JJ exp[i(p • x + E p t)/h] . 


(1.100) 
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This is, in fact, the result required by (1.94). The normalization of the represen- 
tation follows from that of & (1.73): 


±1 = J ^f 3 Wd 3 x = J(U~ l ^f 3 U-^d 3 x 

j $\0-'j'T 3 U- x $d 3 x = J& T 3 $d 3 x 

since, due to (1.97), f 3 U~ x = Ur 3 and therefore 
(TJ- l ^r 3 TJ- 1 = (TJ- x )UJt 3 = U~ X Ur 3 = r 3 . 


( 1 . 101 ) 


( 1 . 102 ) 


Here we have made use of (U x )i = U 1 which also follows from (1.97). In 
analogy to (1.101) we define a generalized scalar product or $ product 


(#1#% =f f ^t 3 & d 3 x . 

One recognizes immediately that, as in (1.101), 


(1.103) 


(1.104) 


i.e. the generalized scalar product is invariant under the transformation (1.95). It 
seems natural to call an operator A, with the property 

W)* = (A<£|A<£% , (1.105) 


$ unitary . Such an operator has to fulfill the condition 


A h = t 3 A} t 3 = A 1 , (1.106) 

since f \P^r 3 \P f d 3 x = f ^'A^Av^d 3 * and, thus, A^f 3 A = f 3 or t 3 A*t 3 = A -1 . 
The U operator (1.96) is a member of this class. If A and f 3 commute, then the 
relation A' = A -1 follows from (1.106), i.e. the usual unitary relation. 

The charge Q of a state is given by the integral 


Q 



(1.107) 


In the following exercise we will show that the average energy of a state & is 
determined by 


E = 


[^f 3 H { ^d 3 x 


(1.108) 
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EXERCISE 


1.7 Lagrange Density and Energy-Momentum lensor of the 

Free Klein-Gordon Equation in the Feshbach-Villars Representation 


Problem. Determine the Lagrange density of the free Klein-Gordon equation in the 
Schrodinger representation (Feshbach-Villars representation). Subsequently calcu- 
late the energy-momentum tensor and show that the energy is given by the expres- 
sion E = f ^T 3 H { &d 3 x. 


Solution. In the Schrodinger representation the equation of motion for the Klein- 
Gordon field reads 


i hdt'w = Hfw with 
rr 

Hf = (T 3 + if 2 )-r hw 2 . (1) 

2m 0 

The vector w has two components, w = (^) => i f and we define 
W = To prove that the Lagrange density 

— h 2 - - 

C — imd t V — — — Vl? 7 (r 3 + 1 T 2 ) Vf 7 — moc 2 \IfT 3 \P (2) 


yields the correct equation of motion, we vary the action integral / — f C d 4 x 
using the standard method. This variation with respect to the components of f' 
results in equation (1), 


dl 

W a 


=> dy 


dC 


-«*= 0 


d{d„y a ) d$ c 

_ d£ „ dC dC n 

d t — ^ — h V • = =r— = 0 

d$ a d(V$ a ) dV a 

fr 

=> ~ 2 ^ + if 2 ) V 2 ^ a - (i hd t & a - m 0 c 2 f^ a ) = 0 


with a = 1,2. 

Similarly, variation of results in the corresponding equation of motion for 
i.e. 


dl 


= 0=> 


d, 


dC 

W a 


+ v- 


=4- — i hd,W a = 


dC dC n 
dCV&a) d& a 

V 2 (P(f 3 + if 2 )) 


OL 


, (a =1,2) 

+ m 0 c 2 (^f 3 ) a 


If we had defined the Lagrange density with S'* instead of the same equation of 
motion (1) would have resulted from the variation with respect to However, 
we demand the action / = j d 3 x dt C to be real which results in the condition that 
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/ 


&(+ihd,)$ - W hL{J l + -- T - 2 -- • V& — rn^c^r^ 


2niQ 


d J x d t 


(partial integration) 


/ 


— ( ft 2 V 2 \ 

S' | i hd, + (f 3 + if 2 ) - moc 2 T2 ) S' 


drx d t 


must be real. This is the case if each of the operators 

n 2 v 2 


ihd, 


2m 0 


(f 3 + if 2 ) + m 0 c z f 3 = Hf 


fulfills the generalized hermiticity condition 


O* 


r 3 d^f 3 = O 


This has already been proven [see (1.109) and (1.110)] to be true. The integral I 
would not be real if, instead of S', the spinor S' + had been used. 

We now calculate the energy-momentum tensor from C, i.e. 


dC 

^ ~ d{d^) 




dC 


d{dj>) 


=-d„V-C 9l 


U,v 1 


to obtain 


h 2 


Tqq = ih\Pd t \t r — ih\Pd t \P 4- — — - 4- iri) • Vtf' 4- rrioc^f^ 

2 mo 


( 3 ) 


( 4 ) 


and 


E = J T 0 0 d 3 x = J Tood 3 

-/ 


(— 

\2mo 


• (t 3 4- if2) 4- m 0 c 2 ^f 3 ^ ) d 3 x 

h 2 


\Tr j _ -41— (f 3 -j- if 2 ) V 2 4- tfiQC 2 T 3 J f' d 3 X 

2mo 


— j ' a 3 jc 


( 5 ) 


which is just (1.108). 


From (1.107) and (1.108) we can now guess the generalization of the expecta- 
tion values of arbitrary operators L and define the mean value (expectation value) 
(L) by 

(L) = I &f 3 bP d 3 x , (1.109) 

where (L) must be real. This results from the condition that 


Exercise 1.7. 


36 


1. Relativistic Wave Equation for Spin-0 Particles 


J $ t T 3 L'I'd 3 x = = J 


Therefore, 


A A. I IT def a I A 

f 3 L = L' f 3 or L h = f 3 L'f 3 = L 


( 1 . 110 ) 


which is the generalized hermiticity for an operator L. We immediately see that the 
hermiticity of the Hamiltonian Hf is general , because 


ft? = T3///T3 = f 3 


(f 3 - if 2 H h f 3 m 0 c 2 

2m 0 




P 1 

(f 3 + if 2 )- — + r 3 m 0 c 


f 3 f 3 = Hf 


(1.111) 


Nevertheless, the operator U (1.96) is ordinary hermitian (i.e. hermitian in the 
standard sense), i.e. 0^ = 0. 

The generalized scalar product (1.103), (1.109) necessarily leads to a transfor- 
mation law for the operators L when changing the states according to (1.95). We 
simply calculate 


[ &t 3 W' d 3 x = [ (p 1 * (U t 3 LU ~ l $' d 3 


/ 


^f 3 ULU~ l 4>' d 3 


= J rfhU&tfx 


using (1.102). Thus, 
£# = ULU~ l . 


(1.112) 


This is the transformation law for operators covering the transition from repre- 
sentation to $ representation (1.95). 

Remark : Notice that the transformation law for operators depends on the defi- 
nition of the scalar product. If, instead of the $ product (1.103, 109). we had used 
the ordinary > scalar product 

(&\V') = yVlP'd 3 * , 

with the matrix elements for operators L being 

(<f r |L|*f r/ ) - I , 

then the operator L in d> representation (we denote it by ZT) would have been 

J&tV d 3 x = J d 3 x = J d 3 x . 
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Thus, 

£4 = = U~ l LU~ x , 

i.e. a different law from (1.112). It is important that, because of physical reasons 
(charge interpretation), only the generalized scalar product ($ product) makes sense 
to us and thus the transformation law (1.112) will be used in the following. 


EXERCISE 


1.8 The Hamiitonian in the Feshbach-Viiiars Representation 

Problem. Show that, in the Feshbach-Villars representation, the Hamiltonian for 
all momentum eigenstates is given by H$ — U H( U ~ 1 = r^E p . 

Solution. According to (1.112) the Hamiltonian Hf of (1.68) for a free Klein- 
Gordon field transforms, under the transition to the Feshbach-Villars representation 
(1.95), according to 


H<p = 0 HfU~ l = 


(E p 4- m 0 c 2 )ll - (ra 0 <r - E p )f \ 


r> 2 

(f 3 + - + hmc 1 

2 mo 


\j 4m 0 c 2 E p 

(m 0 c 2 + E p )l + ( m 0 c 2 - E p )f\ 


y/4moc 2 E p 


If we define 


a + — niQC 2 4 - E p , a- — m^c 2 — E } 


P > 


a ~ nioc 


then it follows that 
1 


Hep = 


[a+ ~ a- f ,] 
[a + - a-fi] 


v 2 

- — (f 3 + if 2 ) + af 3 
2mo 


4m 0 c 2 E p 
1 

4moc 2 E p 

+ a- - — (t 3 T] + IT2T1) + aa + r 3 + aa-T 3 Ti 
2mo 


(a+ +a_fi) 


P~ 

<3+^ (f 3 + lf 2 ) 

2mo 


1 


4moc 2 E 1 


*2 
, 2 P 


P_ 

2 m 0 ' 


+ „ (T 3 +i t 2 ) + a+a-- — (f 3 f\ +if 2 fi) 

2 mo 

2 A , ^ „ P“ 


a2 


4- aal f? 4- aa_a_L_ tSti — — (tit* 4- ifif?) 

p 2 

— a 2 (tit 3 ti 4- ir\T2T\) — aa+a-T\T3 — aa 2 _t\fT > f\ 

2mo 
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Exercise 1.8. 


For momentum eigenstates the operator p can be substituted by its eigenvalue. 
Therefore, with r, f, = i£ ljk r l: (i -j- j -- 1,2, 3) one has 

f 3 + if 2 = +if 2 fi - 1 - TjT] — -ifif 2 - fiT3 

1 1 
1 -1 


-1 0 

0 1 


= -T1T3T1 - Vr\TzT\ 

and 

T 1 T 3 T 1 = -f 3 

as well as 
-fif 3 = 
and we get 

k 9 = 


0 1 
-1 0 


1 


4moC 2 E„ 


{* 


2 r_ 4 _ _ p 

to 

0 1 


pM 


, h a + a- -4 1- a_n+ 4 h at - — , 

2mo 2mo 2 m 0 2mo J 


+ (0 -l) { a l a + a - a } + ( o 

{ 


{2 aa^a+} 


4moc 2 E. 


f 3 1 (a+ + ajf— + (a? + a 2 _)a\ 

1 m 0 ^ J 


.2 P 


+ if^ < (( 24 - "h Cl~ )~ h 2 GCI-CI 4 . 

' ' 2rao 


Using the relation 

(a+ 4- a-y^— — 4mlc A ^— = 2moc 2 p 2 c 2 
2 m 0 2 m 0 


(a+ + a 2 .) a = 2moc 2 = (ra^c 4 + £^) = 2moc 2 [2 m^c A -F p 2 c 2 ) , 


2 <2<2_<2+ 
we obtain 


= 


1 


4moc 2 E p 


== 2moC 2 (mnC 4 — £^) = —2mQC 2 p 2 c 2 , 


f 3 • 4moC 2 Ep + if 2 • 0 ] = f 3 £ p 


From the free Klein-Gordon equation in Schrodinger representation (1.67) 


i h— = H& 

ot 


and by the application of U from the left it follows that 


i h 


dU& 

dt 


u fifty-' u& 
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Using the results of Exercise 1.8 and (1.95), we obtain the free Klein-Gordon 
equation in the Feshbach-Villars representation , i.e. 

d<P 

i h—=r 3 E p $ . (1-113) 

This equation yields two different solutions for any given momentum p, one with 
positive (+ E p ) and one with negative (—E p ) time factor. They are precisely the 
solutions (1.99) and (1.100), obtained by the direct transformation, and can be 
interpreted, according to the time factor, as belonging to positive charge or negative 
charge, respectively (see Exercise 1.9 for further details). 


EXERCISE 


1.9 Solution of the Free Klein-Gordon Equation 
in the Feshbach-Villars Representation 

Problem. Solve the free Klein-Gordon equation in the Feshbach-Villars repre- 
sentation (1.113) directly and show that the solutions are identical to (1.99) and 
( 1 . 100 ). 

Solution. The Feshbach-Villars representation of the Schrodinger equation reads 

lh s7 = ^ ' 

where the d's are eigenstates of the momentum operator 

'tV 


£> = expfip • rf h)9 


6 = 


^2 


= \! n 


mZc 4 + p z c 


2 r 2 


tty inserting the matrix f 3 we get 


i f 4 I =E„ 


v dt 

and, thus, 

\tvd\ = E p d 1 , ik'd 2 — —Ep'd 2 . 

Integration yields 6 \ and 62 is the form of 


tfi = N\ exp ( -jE p t 


■U 2 = N 2 exp yr-E p t 

N\ and iV 2 are normalization constants which are determined by the normalization 
condition ( 1 . 101 ), 
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Exercise 1.9 . 


yVfj$d 3 r = yVf 3 0d 3 r = ±1 , 

yielding 

i^i 2 -i^ 2 i 2 = ±i . 

Hence, we obtain two independent solutions [see (1.99) and (1.100)]. 


= ~jv (o) exp 


-) = _L f° 


-(p-r-E p t) 






exp 


h 


(p-r+ E p t ) 


charge 4- 1 
charge — 1 


Again each linear combination 

, with |rq| 2 — |/l 2 1 2 = 1 , 

is a normalized eigenfunction of the momentum p with charge 4-1, and each linear 
combination with |ni | 2 - Iml 2 = — 1 is a normalized solution with charge —1. 


We denote the solution (1.99) and (1.100) of the Klein-Gordon equation (1.113) 
for fixed momentum p (see Exercise 1.9) by 

P = {P<} = | —j^ n i [ n < = 0,±1,±2,...; A = 1,-1 . (1114) 

From now on we drop the index i on p i9 which has its origin in the box normal- 
ization, to simplify the notation, e.g. means ^ . The index A = 4-1 denotes 

a positive charge state whereas A = — 1 denotes a negative one. The states 
in (1.114) form a complete orthonormalized system. Hence, the following relation 
holds: 

f \T 3 $p '_ d x = A Sp y pt8\y , (1.115) 

where A, A' = ±1 and p , p f take the values given in (1.114). Let us now consider 
a general state $ containing particles with positive as well as negative charge. We 
can recognize and distinguish between the charged particles by expanding ^ in 
terms of 4> v ^\ to give 

& — ^~^ ^Ep,A^p,A ~ (^p, + l^p,+l T l^p,— l) > (1.116) 

p,A p 

where the expansion coefficient a v ^\ is given by 

Op , a = J ®l,\T3$d 3 X 


(1.117) 
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For the total charge we obtain 

e I &Ti$d 3 x = (l%,+i| 2 — |%,-i| 2 ) — . (1.118) 

v 

The total number N of elementary charges which are contained in the state $ 
can be positive, negative or zero. ^2 P \ a p,+n 2 ls ec l ua * 1° the total number of 
positive elementary charges and |a Pj _i| 2 equal to the total number of negative 
elementary charges in 


1.9 The Interaction of a Spin-0 Particle 
with an Electromagnetic Field 


the electromagnetic field is described by the four-vector [defined in (1.14)] 


= {A 0 ,A} = {Ao,A x ,A y ,A z } = g^A l/ , 
A p — Qfji I/A — ; A ]• 


In the case of nonrelativistic quantum mechanics we specified minimal coupling 
of the electromagnetic field, 

~ d ^ e 

E i h— eAo , p => —ih'V A , 

at c 

which can be compressed to the four-dimensional and covariant form as 

p* => P IX - or . (1.119) 


With the same minimal coupling, the free Klein-Gordon equation (1.21) is trans- 
formed into the Klein-Gordon equation with an electromagnetic field (compare 
with the later passage on gauge invariance of the coupling), 

{p* ~ {p* ~ ~ A ^ = nioc 2 ^ (1.120) 

or 


H ( is a^ - ^ 


'ih = mlc 2 ib , 


and, explicitly, 


i (ifi| - M„) i, = ^ (+i *2- + ^,) + %v) t 

= ( (+iW + ^A) + WqC 2 ) tp . 


( 1 . 121 ) 


( 1 . 122 ) 


In order to examine the charge and current densities, we start with (1.121), multiply 
by \p* from the lhs and subtract the complex conjugate. These operations can be 
symbolized by ijj*{xx) — ij}(xx)*, where (xx) denotes (1.121), and they result in 
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A 0 (r) 



Fig. 1.3. The Coulomb poten- 
tial of a negatively charged 
particle in the vicinity of the 
atomic nucleus 


0 = ^ 

- V 


= <r 






d ^ ie A 
dx v ' hc IXy 


a 


d ie 

+ t~A l 


dx ^ he 




ie M \ fd ie 

dx 1 ' he v ) \ x dx^ he 


r 


I 9 9 ,* ,* 9 9 1 I* 9 le A , 

-V> — — V’-V’ 


dx u dx' u 


dx u dx’ u 


dx v he 




he dx v 


= 9‘ 

One calls 




d ( , d ,* ,* 9 / 

— — ( xb xb — xb xb 

dx v V dx v dx v 


he " dx^ 


ihe f d d 

j» = ^zr ( zzz:^ - ^zzz:^ 


2niQ 


dx LJ 


dx 1 


m 0 c 


A u ipip* = {cq\ - j'} 


(1.123) 


the four-current density in the electromagnetic field A u . Obviously, the following 
relation holds: 


qW 9 j = 9 j p — Q 

9 dx^ 1 ' dx» J 


(1.124) 


This is a continuity equation for the four-current density, usually its meaning is 
that of a conservation law. Suitable normalization [like in (1.123)] yields charge 
conservation. Note that (1.123) is identical to the previous result, (1.34), if we 
exclude the term proportional to A v . To make this fact more evident we write 
explicitly, 


d = 


ihe 

2mQC 2 


xb*—xb — xb — ib* I — n 
dt dt ) m Q c 2 




and 


ihe e “ 

-—{xlfVxjj - xPVxP*) Aipr/j* 

2rao niQC 


(1.125) 


(1.126) 


and compare these expressions with (1.35) and (1.36). It is remarkable that electro- 
magnetic potentials appear in the charge and current densities (1.125) and (1.126). 
To obtain a deeper understanding of this fact, we consider a negatively charged 
particle in a central Coulomb potential, which is given by 


eA 0 (r) — Ze 2 V (r) , A = 0 


(1.127) 


V (r) ~ 1/r for large r and rounded off as for an oscillator within the range of 
the nucleus (see Fig. 1.3). A stationary state of the Klein-Gordon equation has the 
form 


t/)(tv t) = ^(r)exp(-i iet/h) , (1.128) 

where \e\ is the energy of the particle (see Exercises 1.10 and 1.11). The charge 
density (1.125) can be calculated as follows: 



1 ,9 The Interaction of a Spin-0 Particle with an Electromagnetic Field 


43 


„ , [e-eAo(r)] , , [e + Ze 2 V(r)] , , 

q ( r ) = e = 'i/j'i/j ( r ) = e = x/jifr (r) 

m 0 c z m 0 c 1 

Therefore, the charge density becomes 

g r > 0 for 5 > eAo(x) , ^'<0 for £< eA 0 (x) 


(1.129) 


(1.130) 


In the first case the charge density has the same sign as the charge, e, of the particle; 
in the second it is the other way round. The charge density has the opposite sign to 
the particle charge e, wherever the potential energy has values so that e < cAq(x). 
The physical meaning of this change of sign of q* in strong fields can only be 
understood within the frame of the field theory where the number of particles 
becomes variable. One can imagine that in the areas of strong fields, particle- 
antiparticle pairs will be produced and that the potential term in the charge density 
(1.129) may be interpreted as simulating such many-body aspects. 

Now let us carry over what we have said about normalization and the energy 
factor e for the Klein-Gordon equation to our present case. The charge normal- 
ization is carried out according to j g'(x)d 3 x — ±e (e being the charge of the 
electron). For tt - mesons (e < 0) we take the positive sign, for bound mesons 
we choose the negative sign. Hence, the charge density for tt~ mesons becomes 

, [e + Ze 2 V(x)\ . ,, icn . 

g (x) = e = %> (x)ip(x) . (1.131) 

moc- 

Figures 1.3 and 1.4 illustrate strong Coulomb potentials and the corresponding 
charge density. 11 



r [fm] 2 


Fig. 1.4. The radial density gr 2 of a 7r meson in the Is state of an exponential potential 
of the form V(r) = Zaexp(— r/a) (see Exercise 1.15). The range parameter a of the 
potential is chosen to be a = 0.2A, where = h/m^c is the Compton wavelength of 
the pion. The coupling strength is fixed by Z = 2158. Such an extreme central charge is 
chosen to make various new features of the Klein-Gordon equation especially obvious. The 
energy eigenvalue of the \s pion bound in this extremely short range and deep potential is 
E = —0.958027 measured in units of the rest energy m n c 2 of the pion. It is evident that 
the charge density is not positive definite, and that antiparticles can also be bound in this 
potential. The radial density of antiparticie in the same potential is shown by the dashed 
line - the energy eigenvalue of a bound antiparticle is E = — 0.983275 m n c 2 


11 See W. Fleischer, G. Soff: Z. Naturforsch. 39a, 703 (1984). 
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An interpretation of the positive part of the charge density is that, in strong 
fields, some fraction of 7r + mesons is always mixed with the tt meson. This indi- 
cates the great difficulties of the single particle interpretation in strong potentials. 


EXERCISE 


1.10 Separation of Angular and Radial Parts of the Wave Function 

for the Stationary Klein-Gordon Equation with a Coulomb Potential 

Problem. Separate the angular and the radial part of the wave function for the 
stationary Klein-Gordon equation for spherically symmetric Coulomb potentials. 

Solutions. We put eAo = V (r) and A = 0. Then the stationary Klein-Gordon 
equation reads 


[(e — V ( r )) 2 — ihqC 4 + h 2 c 2 V 2 ] ip(r) — 0 
or, explicitly, 


( 1 ) 


n 2 c 2 


1 8 


8 


r 2 8r l , r dr j r 2 sin 9 89 


8 


sine na ) + 

89 J r L sin 


8 2 


2 cin 2 $ 8(j> 2 


4>{r) 


= [(£ - V ( r )) 2 - mlc 4 ] ip(r) . 

For the wave function r/>(r) we make the following separation ansatz 
i>(r) = u(r)ne,<f>) . 


( 2 ) 

( 3 ) 


This leads to 

7 . 1 8 9 dti 

hC 7^d?'Tr + 




V(r)f - m 2 c 4 - 21 


u — 0 


1 8 8Y 1 8 2 Y xi , 

sin <9— + — + AT = 0 


sin 9 89 


89 


sin 


9d4> 2 


( 4 ) 


where A is the separation constant. The solutions of the last equation are the 
spherical harmonics T/ m with A = /(/ + 1), / = 0, 1, 2. . . and m = Oil, ±2, ±3 . . , 12 
With this the radial differential equation follows as 


1 8 

~r 2 8? 1 r 


,d_ 

8r 


+ 


/(/ + !)' 


/ , (£ - yf -mlc 4 

“ (r) = — m — u{r) 


( 5 ) 


Hence, using the common ansatz u(r) = R(r)/r , follows the transformation of (5) 
into 

d 2 1(1 + 1 ) l7 1 

! J72 Zi — + k R “ 0 ’ (6) 

where 


2^4 


2 _ (e - V(r)) 2 - m 2 c 
h 2 c 2 


( 7 ) 


12 See W. Greiner: Quantum Mechanics - An Introduction , 3rd ea. (Springer, Berlin, Hei- 
delberg 1994). 
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EXERCISE ■■■■■■■ 

1.11 Pionic Atom with Point-Like Nucleus 


Problem. Find the solution of the Klein-Gordon equation for the n~ meson in 
a Coulomb potential and discuss the energy eigenvalues. The pion has the mass 
m^c 2 = 1 39.577 MeV and spin 0. It obeys the Klein-Gordon equation. 


Solution. We use the result of the previous Exercise 1.10. The attractive Coulomb 
potential Ao(r) = —Ze/r is coupled as the 0 component of the four-potential in the 
Klein-Gordon equation (a = e 2 /he = 1/137.03602 is the fine-structure constant), 
giving 


d 2 _ /(/ + 1) - ( Zol ) 2 , IeZol 
dr 2 r 2 her 


2 4 

WQ C — E 

h 2 c 2 


21 


R,(r) = 0 


( 1 ) 


Now the energies and the wave functions for the bound states in the energy range 
— moc 2 < e < rtiQC 2 are to be calculated. Therefore, we first transform the above 
differential equation with the help of the substitutions 


P 

Q 


2 2~*/2 
_ o L m 0 C ~ £ J 

he 

= (3r with 0 < g < oo 


( 2 ) 


M = 



and 


x 2 Zae 

A= hc(5 


and obtain 


dp 2 


M 2 ~ 1/4 . A _ 1 
Q 2 + Q 4 


R,(g) = 0 


(3) 


In order to find an ansatz for the solution it is useful to first study the limits g — > oo 
and g — > 0. In the case g oo we can neglect the terms proportional to o ' 1 and 
g ~ 2 in (3) and, therefore, 


dg 2 


Rl(g) = 0 


(4) 


Its solution can easily be determined with the help of the exponential function 


Ri(g) = ae~ g/2 + be +e/1 

Q-+ OO 


(5) 


b — 0 follows from the requirement of normalization of the wave function. In the 
case o — > 0 one can neglect the last two terms in the above radial equation and is 
led to 


( d 2 _ ji 2 - 1/4 
\ d^ 2 g 2 


Ri(q) = 0 (for g -> 0) . 


( 6 ) 
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Exercise 1.11. 


With 


R{q) = clq v , 

(7) 

it follows that 


- 1 )Q V ~ 2 -a(n 2 - i) Q "~ 2 = 0 , 

(8) 

which is a determining equation for the power v, given by 


-H 

l<N 

II 

— I^f 

1 

<N 

+ 

-H 

— I<N 

II 

-H 

(9) 


Since /x can, in principle, take all positive values, and the wave function must not 
have any nonintegrable divergence at the origin, this unambiguously fixes vP 

*={+» . (10) 

Furthermore, we see that the radial function u(r) — R(r)/r has a singularity at the 
origin (r = 0) when / = 0, though this singularity can be integrated; therefore, the 
wave function can be normalized in spite of the singularity. This behaviour of the 
wave function for r — > 0 is quite novel compared to the Schrodinger wave function 
and characteristic of relativistic 5 wave functions in the Coulomb potential. Because 
of the definition of fx we further see that, for / — 0, only real wave functions can be 
found for Za < 1/2. For larger values of Z the parameter (x becomes imaginary. 

Because of (5) and (7) we now choose 

Ri(e) = Ne y2+ ^~ e/2 m (ID 


as an ansatz for the full radial equation, where the still unknown function f(g) 
should be constant for g — > 0 and should guarantee the normalization for g — ► oc. 
Inserting (11) into the radial equation (3) we find the following differential equation 
for f(g): 


d 2 / / 2 M + 1 

dg 2 V & 


n^_£L±i/^ /(e) = o 
d g q 


For simplification let us introduce the new abbreviations 
2/i + 1 = c , tx+ \ — A = cl , 
so that 


*L + (i - i 

cV 1 


d/(g) _ a = 0 
dp g J 


( 12 ) 


(13) 


results. 

Now we try to solve the differential equation for f(g) with the help of a power 
series expansion, 


13 This is not true in the special case / = 0. Here the solution with v _ can also be normalized, 
and one must set another criterion in order to exclude this case. For example, one can 
demand that an expectation value of the kinetic energy should exist. See Exercises 9.8 
and 9.9 in which the analogue problem for the Coulomb solutions of the Dirac equation 
is carefully discussed. 



1 .9 The Interaction of a Spin-0 Particle with an Electromagnetic Field 


47 


f(Q) = 'loan'd 1 ' . 

n' = 0 

Inserting (14) into (13) yields 

oo oo 

a n >n f (n f — l)g n ~ 2 + c ^ n , a n >g n ~ 2 

n' = 1 n' — 1 

oo oo 

- ^ n'a n <Q n '~ x - a ^ a n 'Q n '~ x = 0 . 

n' = l n'—G 

A comparison of coefficients of equal powers in o gives 
aao 

a i = 

c 

a\{a + 1) 

a2 ~ 2(c + 1) ’ 


(14) 


(15) 


(16) 


or, generally 

a m - 1 a + m — 1 

= ; r 

m c + m — 1 

Therefore, f(g) can be written as 


fie) = ao 


a a a + 1 p 2 

1 H — g H TT^t" ■*" 

c c c + 12 


OO 


n'—G 


(a)n' Q n ' 
(c)n' n*\ 


(17) 


(18) 


The function defined by the series (18) is called a confluent hypergeometric function 
\F\(a,c\ g). Note, there is always a generalized faculty function in the nominator 
and denominator of each term of the sum. However, the confluent hypergeometric 
function diverges for g — ► oo as 

iFi(a,c\g -> oo) => j^-g a ~ c e. Q . (19) 

r(a ) 

Thus, the normalization condition can only be fulfilled if it is certain that the series 
breaks off at a fixed value of nJ . If a +n f = 0 holds, i.e., if a is a negative integer, 
then all terms of higher order, m > n\ are also equal to zero. 

The energy eigenvalue for bound spin-0 particles in a Coulomb potential can 
be calculated from (13). Starting from 

1 r Zote 

X = „+-+n = kV ,_ £ 2 )I/2 • 

it follows that 


Exercise 1.11. 


£n'l 


-me 


1 T 


(Za) 2 

(«' + £ + [(/ + \) 2 ~ ( Za ) 2 ] 1 / 2 ) 2 


- 1/2 


( 20 ) 
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Thereby, we have chosen the negative square root, because in the case of no fields 
(Za oo) the free solution, i.e. the negative time evolution factors for negatively 
charged particles, must be obtained. The energy itself is given by 


-nl 


/ 


Too = niQC 1 


1 + 


(Za ) 2 


1 - 1/2 


(«' + \ + [(/ + \) 2 ~ (Za) 2 ] 1 / 2 ) J 
(see Exercise 1.12). Defining the principal quantum number as 


( 21 ) 


ti — n “l - / “l - 1 


( 22 ) 


we may rewrite (21) in the following form: 

(Za) 2 


-«/ 


m 0 c 


1 - 1/2 


1 + 


(„_/_!+[(/+ 1)2 .(Za) 2 ]!/ 2 )^ 
n = l,2,3... , / = 0, 1, 2, . . . (n — 1) . 


(21a) 


By setting N' — Nao , the eigenfunction can be written as 

R,(g) = iVV +1/2 e- e/2 ]F, + \ - A, 2 M + 1; g) = N'W^ig) , (23) 

where are called Whittaker functions. Expanding, finally, the above eigen- 

value in a series of powers of Z yields 


E n i = m 0 c { v 1 


rest 

energy 


Z 2 q 2 

Schrodinger 

eigenvalue 


Z 4 a 4 fa 3\ 

2n 4 \l + 1/2 ~4 ) + 

V ^ 

relative 

correction 


(24) 


The binding energy of a bound Klein-Gordon particle is defined as E\> — E — m^c 1 . 
Some numerical values for the binding energy of Is pions (n = 1, Z = 0) with 
mass m~c 2 = 139.577 MeV are listed in the following table. 


Binding energies of 1 a pions for various central charges Z 


Z 

E h l MeV 

Z 

Eb/MeV 

10 

-0.374 

50 

-11.51 

20 

-1.528 

60 

-19.391 

30 

-3.568 

65 

-26.34 

40 

-6.725 




From the energy relation (21a), one learns that for Za > 1/2, there is no energy 
eigenvalue for the 1a state (n = 1, l — 0), because the energy becomes imaginary. 
The energy eigenvalue for Is pions at Z = a _1 /2 « 68 are E n i = m 7r c 2 ( 2) -1 / 2 
(see next figure). The corresponding binding energy is about E b — —40 MeV. 

To determine the energy eigenvalues for .v states for which Za > 1/2, we must 
investigate the relativistic case of extended atomic nuclei. Therefore, in Exercise 
1.13 we will solve the Klein-Gordon equation for the Coulomb potential rounded 
off by an oscillator potential. Furthermore, one has to take into account that pions 
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Solution of the Klein-Gordon 
equation for a central Cou- 
lomb potential. The energy 
is given in units of the rest 
energy m n c l . For the \s level 
there are no solutions for Z > 
68, and, similarly, for the 2 p 
level when Z > (3/2)(l/a) 
« 205. This changes if the 
finite size of the central nu- 
cleus is taken into account 
(Example 1.13) 


and the nucleus interact strongly. Since the pion is 270 times more massive than 
an electron, its wave function already has a considerable overlap with the nucleus. 
Therefore, corrections to equation (21) are to be expected, based on the strong 
interaction, which can be quite massive. 


1.10 Gauge Invariance of the Coupling 

We know that electrodynamics is gauge invariant. By that we mean that the elec- 
trodyncamic laws (Maxwell equations) do not change under gauge transformations 

A'^x) = A^x) + . (1.132) 

The argument x represents all space-time coordinates, i.e. 
x = {x°, x\ x 2 , x 3 } = {ct, x , y, z } . 

We now investigate the consequences of the gauge transformation (1.132) for the 
Klein-Gordon equation (1.121), the transformation producing the change, 


d c 

^ 1 in 7T^ ~ ~K 

dx v c 


" ? A “ ^ = ° ’ 


or 


--a 

dx v c u cdx v 


... d e t 

in- /i, 

ox^ c ■ 




c dx » J 


yj — m^c yj = 0 


or 


<r in 


d_ 

V dx v 


lh ~i^ ~ ^ - m * C ^ = 0 ’ 


(1.133) 


with 


\e 


= exp ( • 


(1.134) 
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In other words the gauge transformation (1.131) changes the phase of the wave 
function only. The phase factor exp[(i e/hc)x\ is the same for all states 0. Since all 
physical observables are represented by bilinear forms of the structure 0* . . . 0 n , 
a common, identical phase factor does not play any role in the physics. Thus, we 
say that the Klein-Gordon equation is gauge invariant for the minimal coupling 



exp 




e dx 

c dx u 


0 


exp 



i h 


dx 1 


0exp 


le 

he 


X 


(1.135) 


and therefore also for arbitrary powers 


r/ e . \ n ,1 

( ie \ 

( * e . \ n 

( le \ 

(pu - -A v ) Ip 

exp Uv = 

( p - - ~c K ) 







= 

& - c A ") ] 



(1.136) 


and, consequently, also for arbitrary operator functions /(p„ - (g/cM,), which are 
expandable in power series of (p v — ( ejc)A v ), giving 





(1.137) 


Thus, minimal coupling is gauge invariant in this very general sense. The paren- 
theses [. . .] used in the above formulae shall help to illustrate the action of the p 
operators more clearly. 


1.11 The Nonrelativistic Limit with Fields 

We can study the nonrelativistic limit of the Klein-Gordon equation in the form 
of (1.122) with the help of the transformation 

0(x,f) = (p(x,t)sx p ( ~^m 0 c 2 t ) . (1.138) 

c p(x,t) characterizes the nonrelativistic part of the wave function, for which the 
relations 

<ra 0 c z M , \eA 0 p\ < m 0 c z \p\ (1.139) 

must be valid. The first expresses the smallness of the nonrelativistic energy 
(ih(d/dt) is the energy operator) compared to the rest energy; the second con- 
dition means that the potentials involved have to be flat compared to the rest 
energy: potentials that are too deep would lead to increased binding energies and, 
finally, to spontaneous pair creation (see Chaps. 12 and 13). This would make the 
nonrelativistic limit impossible. Next we find that 
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and 




d_ 

'dt 


• _ dA 0 .„ 4 dip . .„ , 

ft2 _ _ ,„™ 0 _ + inmoc - a 


a? 


i/ie^Ao + eA^ip - eA 0 m 0 c 2 </? 


+ i/wn<)C 2 — — _L \ / 1 


^ eA 0 (pm 0 c z + m^c ip j exp | --m 0 <rr 
ex P ( — T m o c2?N ) ( m o c4 ~ 2moc 2 eAo + 2moc 2 ih A 


dA 0 

With regard to (1.139), the small quadratic terms have been omitted in the last 
step. Inserting this into (1.122) yields 


exp ( --m 0 c z t J ^m^c 2 - 2m 0 eA 0 + 2m 0 ih ^ ^ 


,^d i he dAa 




= exp ( — -moc t 


+i ZiV + -Aj +mlc 2 


<P 


or 


■.dip 

xh Tt = 


1 / , t _ e \ 2 i he <9A 0 

__^V + -a) +eA 0 + — — 




p " e „ i & i he dA 0 

- A • p 4- eAn + (V • A) + 

2m ( ) m 0 c 2 m 0 c 2m 0 c 2 dt 


( p. (1.140) 


This is the Schrodinger equation for electromagnetic potentials. But be careful: In 
studying radiation problems we generally choose the Coulomb gauge , defined by 

div A — 0 


In this case the term proportional to dA 0 /dt must be included. Only in the Lorentz 
gauge , for which 


1 dAp 
c dt 


+ div A = 0 


is valid, do both of the last terms in (1.140) vanish. 


EXERCISE 


1.12 Lagrange Density and Energy-Momentum Tensor 

for a Klein-Gordon Particle in an Electromagnetic Field 

Problem. Specify the Lagrange density for a Klein-Gordon particle in the electro- 
magnetic field A ltl . Determine the canonical energy-momentum tensor. 
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Exercise 1.12. 


Solution. The Lagrange density for the coupled system of the Maxwell field and 
the Klein-Gordon field is 


£ = --F l 


\xv 


F + p - '( 

2mo LV 


i hd,, - 


e -^)r 


ihd^ — — m^c 2 ^*^ 


F IXV ^ fl A V UyA^ . (1) 

The variation of / = / £ d 4 x with respect to ip* yields the Klein-Gordon equation 
for a p) field, minimally coupled to the electromagnetic field, i.e. 



— niQC 2/ ip6'ip * \d 4 x = 0 


Using Sd^pj* = dfj.S'ip* and with partial integration of the first term, we obtain 
under the assumption that 8pi* vanishes at the boundaries of integration, 

J | ^-i hd^ - -A M j ihd M - -A M j p) - m\c 2 p?^ 6 pi* d 4 x —0. (3) 

Due to the free choice of Sip*, this yields the Klein-Gordon equation 

(p* 1 - (pH - ^ A n) = rn$c 2 ip . (4) 

Variation of I with respect to A /x yields the Maxwell equation in an analogous way 

=ju = jV (du + Jp A Pj -ip(d u - pp A Pj | • ( 5 ) 

The canonical energy-momentum tensor is calculated according to (1) from 


7^„ = 


dC a l t dC a , dC a A cur 

dvip + a , a _ L ^d v ip + a/a A ^ d u A e 8 V C , 


9{daip) 


d(duip*) 


d(dnA e ) 


giving 


= A-(-i hd u ip) (ihd^-ip* - ^ip*) 

+ -L(i hd v %i>*) f-i hd»ip - -A»ip) 
2mq V c ) 

= l 6^F ae F^-d l/ A e F^ + P- 
4 2 7#2q 


d v A e F^ - 8^L 


i hd“ — -A- u ) p)* y — \hd y lp) + \\hdylp*) 


(i - ^Ap 

X (-ihd 1 * - ^Ap ip-8» ( ihd e - i>* 

x i hd e — -Ap tp + 8^mlc 2 ip*ip 
= jW F °e F ° e ~ duA e F «* + . 


( 6 ) 


( 7 ) 


( 8 ) 


( 9 ) 
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Here represents the energy-momentum tensor for a Klein-Gordon particle 

with the minimal coupled electromagnetic field [last three terms of (1)], while the 
first two terms are standing for the free electromagnetic field. 


EXAMPLE 


1.13 Solution of the Klein-Gordon Equation 

for the Potential of an Homogeneously Charged Sphere 

We solve the Klein-Gordon equation for a sphere with radius a since we consider 
this model to be an improved approximation for a realistic pionic atom. The results 
are subsequently discussed. Inside the sphere the corresponding potential is given 
by 


eA 0 (r) = - 


Ze 2 
2 a 


3 


for r < a 


(i) 


This is an oscillator potential which yields, after insertion into the radial Klein- 
Gordon equation [see Exercise 1.10, (6) and (7)], 


dr 2 
— 2 


^ + ( 

( E 3 Za 

\hc^ 2 a 


+ 3Za > 
he 2 a 

\ _L 

J 2a a 2 


2„2 


m^c 

~¥ 


Za 
2 a 3 


2 


R(r ) = 0 


( 2 ) 


This may be further simplified by introducing the following abbreviations: 


E 3Za 
he ' 2a 


B = A 2 


2 2 

m o c 

K 2 


C 


Za 

2cfi 


( 3 ) 


For the total radial wave function, the following ansatz in the form of an infinite 
series is introduced: 


R = r l+l b n >r 2n ' . (4) 

n '= 0 

Insertion into the differential equation leads to 
52 V ( 2n ' + 1 + 1) (2»' + 0 r 2n ' +l ~ 1 

n ' 

- 52 / ( / + 1 ) bn ' r 2n ' +l ~ l + Bj 2 b n' r2n +I+l 

n ' n ' 

- 2 ACj 2 r 2n ' +,+3 + C 52^"- 2 "' +/+5 = 0 • ( 5 ) 

n ' n ' 


Exercise 1.12. 
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Ordering with respect to equal powers in r and comparing the coefficients, we find 


Bb 0 

2(2 / + 3) 5 

Bb\ — 2 A Cbo 
2(4/ + 10) ’ 

Bb 2 - 2ACb\ + C 2 b 0 
2 ( 6 / + 21 ) 


( 6 ) 


or, generally, 


Bb n t- 1 — 2A Cb n > _ 2 + C 7 b n '- 3 
4 ft '7 + (2n ; + l)2w' 


(7) 


The energy is determined by specifying that the wave functions and their first 
derivatives should be continuous at r = a. In the region r > a the wave functions 
are given by the Whittaker function Wx^g) with g = fir [see Exercise 1.11, (23)]. 
Thus, and by using (4), the energy can be fixed by 


£M 2 n' + / + 1 )a 2n ' +l 
W\^((3a) Y, b n'a 2n ' +t+] 

n' 


( 8 ) 


where 

fi — 2 (m^c 4 - E 2 ) 1/2 /he , 
A = ZaE ( mlc 4 - E 2 ) l ' 2 




{l + \ Y-iZefi 2 


, 1/2 


(9) 


For the point nucleus there exist bound solutions (for l = 0) only up to Z = 
l/2a « 68 . In contrast to this, equation ( 8 ), containing the finite extension of the 
nucleus, also yields solutions for larger values of Z (see figure below). Choosing 
the nuclear radius to be a ~ 10 fm = 10 - 12 cm, we get a nearly parabolic growth 
with Z for the binding energy. 

At Z — 1500 the energy eigenvalue for a Is pion approaches E — 0. For larger 
Z this value even becomes negative. At Z ^ ^critical ~ 3000 we find E = —m n c 2 . 
An extrapolation of this behaviour for still larger Z would make the state dive into 


Energy eigenvalues for a n~ 
meson in the Coulomb poten- 
tial of an extended nucleus. 
The charge distribution of the 
nucleus is assumed to be a 
Fermi distribution [see (10)] 
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the lower continuum of E < - m n c 2 . But at this point at least the single particle 
interpretation collapses totally because the new degree of freedom of pair creation 
has to be considered (see the later discussion, together with the Dirac equation, 
Chap. 12). One also has to take into account that, for pions (being bosons), the 
Pauli principle is not valid; thus, in principle, a state may be occupied by any 
number of pions. This is only true as far as the pion-pion interaction is neglected. 



The interaction among the pions will, in fact, inhibit the infinite spontaneous 
tt+tt - production. That this critical region, in which spontaneous pion production 
is possible, can be reached only for very large Z values may at once be estimated 
by considering the condition |V| > |£o| > 2m 7r c 2 . Only for Z ~ 1500 does the 
depth of the potential reach the value |V r | ~ 2m 7V c 2 ~ 280 MeV. In the above 
figure the energies for Is and 2 p pions, calculated for the potential of a Fermi 
charge distribution, are shown. The Fermi charge distribution has the form 

N 

ftmcleus(r) = 1 + exp(41n3[(r - c)/t]) ’ (10) 

with the constant of normalization N given by 


/*OG 

4n I g(r)r 2 dr—Z , (11) 

Jo 

and is shown in the second figure. Here c is the half-density radius. The surface 
thickness t characterizes the region in which the density drops from 90% to 10% 
of its value at the origin (r = 0). This also determines the constant, 4 In 3. The 
potential results as a solution of the Poisson equation and is given by 

V (r) = -47 re 2 I - j g(r / )r f2 d r l + j o(r ! )r ! dr'l . (12) 


The value E = —m n c 2 is reached at Z = 3280 for the Is state and at Z — 3425 
for the 2 p state. 


Example L13. 


Fermi charge distribution of 
a nucleus 
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1.14 The Solution of the Klein— Gordon Equation for a Square- Well Potential 


V 


R 

1 *r 


-V 0 


Squaro-well potential 


This situation corresponds to minimal coupling for V, which is described by (see 
figure on the left) 


V = 1 ~ V ° for r - R 
0 for r > R 


{ 


We set 


k, = + V 0 ) 2 - m%c 4 for r < 


R 


and 


k 0 — — mfic 4 for r > R . 


( 1 ) 


With this the Klein-Gordon equation reads 


_d 

dr 



- /(/ + 1 ) + k 2 r 2 


u(r) = 0 


( 2 ) 


The substitution g = kr leads to 


d d d 2 2 d 2 

57 =% a " d ir* =k dp ' 

Hence, one finally arrives at Bessel's equation 


} u 2 9 

2 Q— + Q — 1(1 + 1) + Q ) u(g) = 0 


d_ 

: d g 


( 3 ) 


The solution for r < R is u(o) = NjXkir). The Neumann function ni(k:r) must be 
excluded as a solution because it is irregular at g = 0. For V — 0, we always get 

«(<?) = Nj,(kr) with k = ~^/ £ 2 -m 0 2 c 4 

and where N is the normalization factor. Obviously, the solution is symmetric for 
positive and negative values of e. The Bessel functions can easily be calculated, 
and one gets 


. . . sin g sin g cos g 

J0KQ) = , J\KQ)=— • 

Q Q Q 

Furthermore, one can use the general recurrence relation 

fn-\(e) +fn+i(g ) = (2 n + 1 )g~ l f„(g) , (4) 

where /„ stands for the Bessel function j n , the Neumann function n n (g) or the 
Hankel functions 
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hn\@) —jniQ) + i«n(£>) or 

hn\o) = jn(Q) - in n (Q) . 


In the outer region we set 


2 4 2 

2 


h 2 c 2 


- -l 2 

= /Cq 


With the substitution o = lkr the differential equation (3) becomes 


(5) 


d 2 u „ dii 

d? + ^ + 


/a + 1) 

Q 2 


u(g) = 0 


( 6 ) 


and the general solution of this differential equation is 

fy (1) (e>) =Mq) + m(Q) • (7) 

We see that cannot be a solution for r > /?, since that function increases 
exponentially and so cannot be normalized. For an imaginary argument the Hankel 
function of the first kind is given by 14 




h?\ifir) 


h^fifir) = 


1 ( 1 

fir fi 2 r 2 

_1_ 3 

fir fi 2 r 2 


-0r 


+ 


/3-V 3 




( 8 ) 


Higher orders can be calculated by use of the above recurrence relation (4), and 
the derivative can in general be calculated to yield 


(2n + = nf n -i(z) — (n + l)f n+ i(z) - (9) 

dz 

The determination of the energy value e requires the equality of the logarithmic 
derivatives of the solutions at r — R, so that 


1 dwj 
u x dr 


1 d u 0 
u 0 dr 


for r = R 


( 10 ) 


where the normalization constants cancel. The eigenvalues e can be derived from 


HikR) h^'fikR) nn 

jiihR) hf\ikR) 

Now consider the special case of s states for the determination of e. For this case 
one obtains after insertion of the definition of jo and and calculation of (11) 


ki cot(kiK) = -k . (12) 


14 See M. Abramowitz, LA. Stegun: Handbook of Mathematical Functions (Dover, New York 
1965), p. 438. 


Example 1.14. 
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Example LI 4, 


The energy of a meson 
in the Is state as a func- 
tion of the nuclear charge Z. 
The radius of the nucleus is 
increased according to R — 
Yy V \ 1 / 2 where A = 2.5 Z is 
the number of nucleons in 
the nucleus 


The energy of a n~ meson 
in the Is state as a function 
of the nuclear charge Z. The 
radius was fixed at R = 3, 4, 
5 fm, etc. 


From this transcendent equation one can iteratively determine the eigenvalue e. 
Numerical solutions of that problem are shown in the following three figures. 

In the first figure the depth of the potential is set to V 0 = -Ze 2 /R, where 
the nuclear radius is R = r 0 A l/ ' 3 with r 0 = 1.2 fm and the number of nucleons 
A = 2.5 Z. At Z = 5150, e reaches the critical value £ cri{ = —m-c 2 . There the 
bound pion level dives into the lower (negative energy) continuum. 


m^c 2 

1001 

5o| 

0 

-50 

- 100 
- m n <? 





100 200 500 1000 2000 5000 Z 




In the second figure R was fixed (independent of Z). For smaller radii £ crit is 
shifted to smaller values of Z. 



In the final figure the radius is compressed to R = 0.25 fm. Furthermore, the 
charge symmetric solutions to the potential are plotted. These curves show reflection 
symmetry about the e — 0 axis. It is essential in this case that the curves exhibit 
“critical points” for R < 1 fm, characterized by the infinite slope of the binding 
energy as a function of Z. The first critical value is at e « — 0.8moc 2 . 

Here the solutions coming from the lower continuum into the region of bound 
states (7T+ states) meet with the solutions from the upper continuum ( 7 r _ states). 
This leads to the possibility of the spontaneous production of many tt+ — tt~ 
pairs (a true pion condensate). The deeper meaning of the phenomenon will be 
more clearly understood after the discussion of the phase transition of the electron- 
positron vacuum in supercritical fields. It is remarkable that a given short-range 
potential is able to simultaneously bind tt~ as well as 7 r + particles. This Schiff- 
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The energy of 7 r mesons as a 
function of the nuclear charge 
Z, if the radius R of the nu- 
cleus is compressed 


Snyder-Weinberg effect has its physical basis in the pionic charge density £>'(r), the 
definition of which includes positive as well as negative frequencies. Therefore, the 
pion is not a particle with a charge distribution of only one sign [see our discussion 
in connection with (1.129)]. 


EXERCISE 


1.15 Solution of the Klein-Gordon Equation for an Exponential Potential 


Problem. Solve the Klein-Gordon equation for an exponential potential of the 
form 

V(r) = —Zae~ r/a (1) 

with a = moc 2 e 2 /he. In natural units (a = c = mo = 1) a is equivalent to the 
Sommerfeld fine structure constant, i.e. a ~ 1/137. a characterizes the range of 
the potential. Restrict yourself to a states (l — 0) only. 


Solution. Under the restriction to j states, and, thus, the neglection of the centrifu- 
gal term, the Klein-Gordon equation assumes a very simple form. From (6) and 
(7) in Exercise 1.10 we have 


dr 2 


+k 2 


R(r) = 0 


( 2 ) 


with 

l2 _ [£- V(r)] 2 -m 0 v 

* h 2 c 2 

With the aid of the separation ansatz 
R(r) = e r/2a w(t) 
and the substitution 
t = 2\Za.^—zT r ! a , 


( 3 ) 

( 4 ) 


( 5 ) 
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we can transform the differential equation (2) into 


d 2 w f 

d^ + t 


1 i ea 1/4 — p 2 a 2 \ 

— — t i — : } w = u 

4 net t* ) 


( 6 ) 


with 


2 


mxc 


P = 


ftc 2 


(V) 


Equation (6) corresponds to the Whittaker differential equation 15 for which the 
regular solution for r — > oo (i.e. t — 0) is. thus, 


w(t) = NW x ^(t) = Nz-'/ 2 1 1/2+ ^ ,F, + n - A, 1 + 2 M ; t) , 


with 


x lew 


and 


( 8 ) 


( 9 ) 


p = pa . (10) 

N is the normalization constant. For the radial Is wave function we can write, in 
summary, 


R(r) = Ne r/2a W x>ti (2i Za^e"^ 0 ) . 


( 11 ) 


To obtain the energy eigenvalues we demand that R(r) vanishes at the origin 
(r = 0). Tnis is the only way to guarantee the normalization of the radial wave 
functions. Tnis condition leads to the eigenvalue equation 


~ + P ~ A, 1 + 2n: 2iZa— 


= 0 


( 12 ) 


which is an implicit equation for the determination of the energy eigenvalues e 
for s states, e can be obtained from (12) only by using numerical methods. The 
calculated energy eigenvalue e of an Is pion in an exponential potential are shown 
in the next figure as a function of the coupling strength Z. The range constant 
a of the potential is chosen to be a = 1 • where = h/m^c denotes the 
Compton wavelength of the pion. At Z cr ~ 778 the Is state reaches the boundary 
(e = —m^c 2 ) of the negative energy continuum. In a second numerical calculation 
the range constant is changed to a = 0.2 • A^. 

This causes a rise in the critical value to Z ^ 2158, which can be seen in the 
last figure of this exercise. Additionally, we find here, for the domain 2150 < Z < 
2158, a bound state for the antiparticle. The corresponding energy eigenvalues can 
be taken from the dashed line in the insert to the figure. The appearance of these 
bound antiparticle states is correlated with the short range of the potential and the 
fact that the radial density gr z is not positive definite. The radial densities are 
depicted in Fig. 1.3 following (1.131). 

15 M. Abramowitz, i.A. Stegun: Handbook of Mathematical Functions (Dover, New York 
1965). 
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The energy eigenvalue of a 
Is pion in an exponential po- 
tential as a function of the 
coupling strength parameter 
Z. The constant a character- 
izing the range of the poten- 
tial is a = IAtt. The criti- 
cal charge is Z cr — 778. E 
is given in units of the rest- 
energy m n c 2 


The same as in the foregoing 
figure for a = The 

energy eigenvalues of bound 
antiparticles are also calcu- 
lated. Z cr — 2158. The in- 
sert in the figure shows e for 
the domain 2150 < Z < 
2158. The dashed line char- 
acterizes the energy of the 
antiparticle state 


EXERCISE 


1.16 Solution of the Klein-Gordon Equation for a Scalar 1/r Potential 


Problem. Solve the Klein-Gordon equation for a scalar interaction of the form 

W(r) = -Za/r (1) 

which is coupled to the square of the mass. For the present let a be an arbitrary 
constant, the coupling strength is determined by Z. 


Solution. As an example of scalar coupling, we want to solve the Klein-Gordon 
equation for a long-range 1/r interaction. We introduced the Coulomb potential 
into the Klein-Gordon equation by minimal coupling (p^ > ( e/c)A M ). In 

contrast to this we now couple a scalar interaction U (r) to the square of the mass 
in the equation of motion, i.e. we perform the substitution m^c 4 — > m~c 4 + U 2 (r). 
A direct coupling to the mass would yield mixing terms of the form 2moc 2 U (r)\ 
however, we do not want to investigate this case in greater detail here. Thus, the 
radial Klein-Gordon equation with an arbitrary scalar interaction reads 


d 2 /(/ 4- 1) e 2 m-c 4 
dr 2 r 2 ' h 2 c 2 h 2 c 2 


U\r) 
h 2 c 2 


R(r) = 0 


( 2 ) 


The scalar interaction is independent of the charge of the spin-0 particle considered, 
i.e. it has the same effect on particles and antiparticles respectively. However, since 
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there is no experimental evidence for such a long-range interaction, our calculations 
are only of academic interest. Nevertheless, it is instructive to pursue the formal 
solution of the Klein-Gordon equation for this unusual type of potential and to 
determine possible critical values of the coupling strength. So we divide (2) by 
mlc 4 j'T&c 2 and substitute 


, ITIqC 

r = r- 


hc 


r ' is a dimensionless quantity. This yields 


(±. 

V dr' 2 


/(/ + 1) + £ : 


r /2 


mkc 4 


- 1 


U\r f ) 

mfic 4 


Now we specify the interaction by 
mic 4 r 


/?(/-') = 0 


(3) 


(4) 


(5) 


and, further, by 


b 2 = 1 - 


mZc 


2^-4 


( 6 ) 


Thus, we get 

(— 

\d r a 


i(i + i) 


r /2 


b 1 + -- ) R(r f ) = 0 


r' J 


Substituting again with 
q — 2 br' 
we now have 

/(/ 4 - 1 ) 1 


(*- 

W 


4 + f 


R(q) = 0 


with 


c — 


Za 

2b 


(7) 


( 8 ) 


(9) 


( 10 ) 


Now consider the asymptotics q — > oo and o — > 0. hor q — > oo it follows from 
(9) that 


^ i) R(e) = 0 

which yields immediately 
R(q) (x e -e/2 . 


( 11 ) 


( 12 ) 
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Analogously for g — > 0, 


( d 2 /(/ + 1) 
Vdf ? 2 *? 2 

with the solution 
R(Q) <X Q l + l 


R(q) = 0 


(13) 


(14) 


that may be normalized. Thus, for the solution we choose 

R(Q) = Ne l+l F(e)c~ e/2 , (15) 

where F(g) must still be determined. Putting (15) into the differential equation (9), 
we get 

+ K 2/ + 2) - 2 ] ^r [c - ( z + = 0 • (16) 

This is Rummer’ s differential equation , 16 with the solution 

F(q) — \F\Q + 1 — c, 2/ + 2, g) . (17) 


The confluent hypergeometric series allows normalization only if the first arguments 
equals a negative integer or zero. Hence, the condition for the determination of the 


energy eigenvalue is 

/ 4- 1 — c — —n r with (18) 

n r = 0,1,2,... . (19) 

With the definition of the principal quantum number 

n = l + 1 + n r , (20) 

we get from (18) 


In analogy to Schrodinger’s equation for a Coulomb potential the energy eigen- 
value £ does not depend on the orbital angular momentum quantum number l. 
Additionally, e is symmetric for particle and antiparticle states, as expected. The 
critical value for the coupling strength Z is easily determined; it follows that 

In 

Z cr = — . (22) 

a 

Inserting the value 1/137 for SommerfekTs fine structure constant, we get Z cv (ls) ~ 
274.07 and Z cr (2s) — Z cr (2p ) ~ 548.14, and for these values the derivative of the 
energy as a function of the coupling strength is 

16 M. Abramowitz, I. A. Stegun: Handbook of Mathematical Functions (Dover, New York 
1965). 


Exercise 1.16. 
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de 


dZ 


z=z cr 


= ±00 


( 23 ) 


In summary, the complete radial wave function has the form 
R{q) = Ng l+i ,F,(-n + / + 1,2/ + 2, p)e~ e / 2 


(24) 


where R(g) can be expressed by the generalized Laguerre polynomials 


R(e) = Ne ,+l {n (n l +l) ] V ' (2l + e~ e/2 . 

The normalization factor N of the radial wave function is determined by 


(25) 


/' 


g(r)r 2 dr = ±l , 


(26) 



with the radial density 


e(r)r 2 = eR 2 {r) 


(27) 


The plus sign in (26) is valid for those states which with decreasing coupling 
strength enter from the upper continuum ( e > moc 2 ) into the energy domain of the 
bound states, while the minus sign is valid for antiparticles which enter from the 
lower continuum (e < —moc 2 ) into the domain of bound states. The evaluation of 
the normalization integral yields 


N = 


(n ± /)! 1 

2\e\ n(n-l - l)i (21 + 1)! 


( 2 pf /2 


(28) 


The above figure shows the energy eigenvalue of the Is state and the 2s state as 
a function of Za ~ 1/137. The values for particle (straight line) and antiparticle 
(dashed line) states are given. e(Z) shows the square root behaviour due to (21). 
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EXAMPLE 


1.17 Basics of Pionic Atoms 

Pionic atoms consist of a nucleus and one (or more) 7r” mesons which “circle” 
around it. The pions are generated in an inelastic proton scattering process (e.g.: 
p-bp— >p-bp-b7r“-b 7T + ), slowed down, filtered out of the beam and are in- 
cident on the elements under investigation as slow pions. Passing near an atom, 
the pion is captured by a simultaneous emission of electrons (Auger capture). The 
probability that two or more pions are simultaneously captured is extremely low. 
Thus, in general pionic atoms have only one pion, as occurs similarly with muonic 
atoms. Since pions have spin 0, they are described by the Klein-Gordon equation. 
The interaction of a pion with a nucleus consists of two essential parts: first the 
electromagnetic interaction, which is described by the Coulomb potential Ao(jc), 
and second the strong interaction. The latter consists of a real and an imaginary 
potential. A pion interacts with the nucleons in the nucleus according to the reac- 
tions: 

7r“ + p n , 

tt + + n p , 

7T -b 2N — > 7T -j- 2N . (1) 

For example, a tt“ can completely disappear in a nucleus if it is caught in the 
conversion of a proton into a neutron. This capture process leads to an imaginary 
potential, by which it is globally simulated. Another possibility is a kind of “chain 
reaction” of the form 

7T~ -J- p — > n — > 7T~ -J- p — > n — > 7T~ p — > . . . . (2) 

We abbreviate this - as above - by 7r + 2N — > tt -b 2N. This chain reaction leads to 
an effective, real, optical potential for the pion, which is known as the Kisslinger 
potential , 

V? ssl (x) = A 0 q(x) — Ai V • (Q(X)V) . (3) 

An improved form of this potential, the Ericson-Ericson potential , is given by 

, (4) 

where g(x) denotes the nucleon density and Aq, A\ and a are constants which 
are fitted to a large number of nuclei ( 7 r atoms). The imaginary potential is also a 
function of the nucleon density, namely, 

^ mag - (*) = i (B 0 q 2 (x) - B x V • {q\x) V) ) , (5) 

where Bo and B\ are again constants. 17 

17 See, for example, Y.N. Kim: Me sonic Atoms and Nuclear Structure (North-Holland, Ams- 
terdam 1971); J.M. Eisenberg, W. Greiner: Nuclear Theory, VoL2: Reaction Mechanisms, 
3rd ed. (North-Holland, Amsterdam 1987). 
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-h 

h 


-f — i 
+-+ 


/ t T ( { 


t t - f - t - 


_3 d 
— 3 p 
-3s 

-+~2 p 

+~2s 


/ / / / / / / / / / hhh- 1 6 

Schematic spectrum of a pi- 
onic atom. The hatching in- 
dicates the broadening of the 
inner levels due to pi on cap- 
ture by the nucleus. The 
3 p 2s — and 3d — » 2s — 

radiative transitions are also 
shown 


Because of these nuclear potentials, which we derive and confirm more rig- 
orously in Examples 1.22-24, there are deviations from the spectrum of a pionic 
atom, calculated on the basis of the Coulomb potential for a point nucleus in Ex- 
ercises 1.11 and 1.13. Since the Kisslinger potential is short range [its range is 
reduced to the nucleus because of the dependence on g(x)] 9 then mainly the Is, 2s 
and 2 p states are influenced, because only the wave functions of these inner states 
overlap considerably with the nucleus. Finally, the imaginary potential generates a 
broadening of these inner states, which expresses the pion capture by the nucleus. 
The figure on the left illustrates the situation qualitatively. 

Behaviour of Solutions of the Kiein-Gordon Equation 
Under Lorentz Transformation 

Our acceptance of the special theory of relativity requires that relativistic wave 
equations nave to be form invariant under Lorentz transformations, i.e. 

x'"' = a lrl y x u , (6) 


with a^ y a u a For further investigations of the wave function ^ it is con- 

venient to use the covariant notation (1.21) of the Kiein-Gordon equation (see 
Exercise 1.5). Since the square of the four-vector p“, i.e. p 2 — does not 

change under Lorentz transformations, one can see that the wave function is just 
multiplied by a factor of absolute value 1 in these transformations. In other words, 
in the case of the coordinate transformations (6), which are simply abbreviated by 

x — > x' = ax , (7) 


the transformation law of the wave function of the Kiein-Gordon equation (1.21) 
reads as 


-0(jc) — ► x ') = A xp(x) 


( 8 ) 


with |A| = 1. If the Lorentz transformation is a continuous one (rotation through 
an arbitrary angle in the four-dimensional space), i.e. if the transformation matrix 
a^ y depends continuously on variable parameters ai,a 2 , . . then the factor A is 
equal to one, because A = 1 holds for the identity transformation characterized by 

OL\ = 0-2 = • • • = 0 . 

Let us now consider space inversion, which is a discrete Lorentz transformation: 


Two-fold performance of space inversion leads to the identical transformation. 
Since the wave function should be unique, one has 

A 2 = 1 or A = ±1 . (10) 


This means that in the case A = 1 (8) yields 

Ip'( x', t ') = t ) = 1 p(x, t) . (11) 

Thus, the v function is then a scalar. For A = — 1 is 

ip'ix'it') = = -Ip(x,t) , 


( 12 ) 
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and one is dealing with a pseudoscalar wave function. We conclude: solutions 
ip(x) of the Klein-Gordon equation are either scalar or pseudoscalar functions, i.e. 
functions which are both invariant under spatial rotations and proper Lorentz trans- 
formations and also which are invariant (scalars) or change sign (pseudoscalars) 
under space inversions. 

The transformation laws of the wave functions (8) are an essential characteristic 
for the properties of particles described by the Klein-Gordon equation (the same is 
true for particles which satisfy other wave equations). Wave functions which do not 
change under spatial rotations describe particles with spin 0, which is, so to say, a 
group theoretical argument 18 for the fact that the Klein-Gordon equation describes 
particles with spin 0. The absence of level splitting due to spin interactions in 
pionic atoms (no 2 p x / 2 — 2p 3 / 2 , 3 d 3 / 2 — 3 d 5 / 2 , . . . splitting, but only 2/?-, 3d — , . . . 
levels) uniquely leads to the conclusion that pions have spin zero. It has turned 
out that the K mesons (wk ~ 960 m e ) also have spin zero. The question whether 
pions are described by scalar or pseudoscalar wave functions has to be decided 
experimentally. 

The Pion’s Pseudoscalar Character 

To complete our knowledge of the pion inner wave function, people have searched 
for the creation of two neutrons in the capture of a slow 7r“ meson by a deuteron 
and this reaction has, indeed, been observed. We now show that this proves the 
pseudoscalar character of the pionic wave function. The first phase of the process is 
the creation of a 7r-mesic deuterium atom in the Is state. The spins of the deuteron 
and the ir~ meson are 1 and 0, respectively. Therefore, the total angular momentum 
of this starting state, i.e. the pionic deuteron atom in its ground state, is 1. Its parity 
is equal to the inner parity of the pion, because the inner parities of both nucleons 
(which make up the deuteron) can be assumed to be equal, and the parity of the Is 
relative wave function equals 1. In the final state two neutrons have been created 
and the pion has been absorbed. Due to Pauli’s principle a system of two neutrons 
can only be in the following antisymmetrical states (pay attention to the spin!): 

1 So 3 Po 3 Pi 3 p 2 1 d 2 ... • ( 13 ) 

During this reaction parity and total angular momentum are conserved. Since the 
total angular momentum in the starting state was 1, only those states shown in 
(13) which have spin 1 can appear in the final state. The only one satisfying this 
demand is 3 r\ with L = S = J = 1 and, because of L — 1, this state is of negative 
parity P . Consequently, the starting state of the reaction is of negative parity, too. 
This is only possible with a negative inner parity of the pion. Panofsky et al. 19 
verified this with the observation of the reaction 

7r~ H- d — > 2n . 


18 See the detailed discussion of this point in W. Greiner, B. Muller: Quantum Mechanics - 
Symmetries , 2nd ed. (Springer, Berlin, Heidelberg 1994), Chap. 1. 

19 W.K.H. Panofsky, R.L. Adenot, J. Halley: Phys. Rev. 81, 565 (1951). 
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1.12 Interpretation of One-Particle Operators 
in Relativistic Quantum Mechanics 


This a rather general subject. However, in this section we want to study the possi- 
bilities of measuring eigenvalues of one-particle operators, especially for the case 
of spin-zero particles described by the Kiein-Gordon equation. This point is rather 
important, because we will see that the naive one-particle interpretation encounters 
problems. We have already seen this fact in the charge distribution (1.131) of a 
pionic atom. Nevertheless, we shall try to maintain the one-particle interpretation 
as far as possible. 

Let us return to the Schrodinger form of the free Kiein-Gordon equation , 

09 

ih—=H { V , (1.141) 

with 

*<*■') =(S!) < u42) 

and the Hamiltonian 

Hi = (f 3 + if 2 ) f— + moc 2 t 3 . (1.143) 

2 m 0 

With the help of (1.141), the column vector 9(x,t) may be evaluated at any later 
time r, if the values &(x,0) are known at t = 0. This may be expressed by the 
transformation 

&(x,t) = S(t)9(x,0) . (1.144) 

The transformation operator reads 

5(f) = exp(-^ = l + ^-^)f+^-^) (1.145) 

and is P unitary ’ because of the property 

. (1.146) 

As in nonrelativistic quantum mechanics, time dependence in the relativistic case 
need not be expressed by the state vectors t) {Schrodinger picture) but can also 
be incorporated in the operators. This picture, where the operators and not the state 
vectors are time-dependent, is referred to as the Heisenberg picture. The change 
from the Schrodinger to Heisenberg picture is performed by the transformations 

= S~\t)]p(x,t) (1147) 


5 H (f) = f 3 S^(t)f 3 = exp 


and 


P„(t) = S~ 1 (t)P(0)S(t) 


(1.148) 
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For the scalar products it yields 

(nx,t)\F(0)\r(x,t)) = {S(t)^ H (x)\F(0)\S(t)^(x)) 

= (V H (x)\S u (t)F(0)S(t)\tP' H (x)) 

= {'P H (x)\F H (t)\'P{ i (x)) , (1.149) 

i.e. the change from the Schrodinger to the Heisenberg picture leaves the ^ scalar 
product invariant. In doing so we have made use of (1.146). From (1.148), it follows 
directly that for time independent Hf , 

i h— = ih- fc&t/hpi 0)e-i^/ft\ 

At At l J 

= -HfF + FH { = [F,H f ]_ , (1.150) 

in analogy to nonrelativistic quantum mechanics. 20 From this relation it follows that 
the physical observables F, whose corresponding operator F commutes with Hf, are 
constants of motion. This means that the expectation values of these operators are 
constant in time. One of the basic postulates in nonrelativistic quantum mechanics 
states that the eigenvalues of an operator describe the measurable values of the 
corresponding classical observable (physical quantity) in a state of the system. To 
satisfy this postulate in the relativistic theory we must modify the definitions of 
some of the operators. 

We illustrate this for the well-known example of energy. The eigenvalues and 
eigenstates of the operators Hf (1.143) are determined - in the case of free motion 
with momentum p - by the equation 


H f w=Ew . (1.151) 

We know from our earlier discussions [see (1.74-79 and 114)] and from Exercise 
1.9, that (1.151) has two solutions 

¥ x (x) = -j= e ip - x/h , A = ±1 , (1.152) 

VU \XoxJ 

for the corresponding “energies” 

E x = A E p = A cxjp 1 + title 2 . (1.153) 

E - 1 is negative and, therefore, we cannot interpret it as a one-particle energy, 
which must always be positive. Here we have a remember the double meaning of 
the energy eigenvalues of the Hamiltonian in nonrelativistic quantum mechanics: 
first they represent the energy of stationary states and second they characterize the 
time dependence (time evolution) of the wave functions. We have already learned 
that the eigenvalues E\ of Hf also represent the time dependence of the wave 
functions in the relativistic theory [the time factors exn(±i E p t/h) in (1.78) and 
(1.79)]: 

20 See W. Greiner: Quantum Mechanics - An Introduction , 3rd ed. (Springer, Berlin, Hei- 
delberg 1994), Chap. 15. 



70 


1. Relativistic Wave Equation for Spin-0 Particles 


#a(x, t) = exp (-iftft/h) &\(x) = exp (-iA E p t/h) &\(x) 
= \P\(x)exp (— iA E p t/h) 


(1.154) 


The energy of these states is always positive and, hence, A independent. Previously, 
we derived this result by using the canonical formalism [see (1.61)]. We can also 
see this from the following statement: The energy e of a system in a stationary 
state is identical with the mean value of the energy, i.e. 






(1.155) 


We proved this in Exercise 1.7 by making use of the canonical formalism. Now 
with 


H f &\ = E\& x = ^Ep&\ and / ^{f^x d 3 x = A 
we have 

£a = A Ep J &lti'Z r \ d 3 x = A 2 Ep = Ep . 


(1.156) 


The energy is always positive and independent of A. Thus, the problem of the 
energy is solved. To resume: The dual character of the eigenvalues of Hf , i.e. as a 
characteristic factor of the time evolution and as an energy, evolves quite naturally 
in the relativistic quantum theory. We can give the correct interpretation of the 
energies of the states by making use of the canonical formalism. Hence, the energy 
operator is not Hf but f 3 # f [see (1.155)]. 

In nonrelativistic quantum mechanics there is always a correspondence between 
a relation of operators and that of classical objects (measurable values). For exam- 
ple, Newton’s classical equation of motion corresponds to the operator equation 


dp 
d t 




with H — p 2 /2m + U(x) (Eh ren Test’s theorem). Another example is given by the 
operator relation 


dx 

~di 



V_ 

mo 


(1.157) 


which corresponds to the classical relation between the velocity and linear mo- 
mentum. Because these operator equations are of the same form as the classical 
equations, it is certain that the quantum-mechanical mean values satisfy the classi- 
cal equations of motion. 21 In relativistic quantum theory the situation is different. 
For instance, in the last example the expression for the “velocity operator” of a 
relativistic spin-0 particle was 


dx 
d t 


m I 1 ’*'] 


(?3 + i T 2 ) — 

mo 


(1.158) 


21 See W. Greiner: Quantum Mechanics - An Introduction , 3rd eu. (Springer, Berlin, Hei- 
delberg 1994). 
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while the classical relativistic velocity is given by 
dx p c 2 p c 2 p 

d7 ~ M ~ Afc 2 ~ ~Y 

where M — m^j{\ — v 2 /c 2 y /2 denotes the relativistic mass, i.e. 


(1.159) 


E - Me 1 = 


me* 


mqc 


= c 1 


= C\ 


yl — v 2 /c 2 \J 1 - v 2 /c 

Im^v 2 + WqC 2 (1 — v 2 j c 2 ) 

1 — v 2 jc 2 


I v 2 ( v 2 

T + ( 1 T 


rriQV 


(1 — v 2 /c 2 ) 1 ' 2 


+ m^c 2 — c\jp 2 + mlc 2 


is the total energy of a free particle with rest mass mo. Obviously, the rhs of (1.158) 
and (1.159) are different. Furthermore, we notice that the eigenvalues of the matrix 


f 3 + if 2 


1 1 
-1 -1 


are zero! This also means that the eigenvalues of the velocity operator (1.158) are 
zero, too. Hence, we again notice that, in general within a relativistic theory, the 
eigenvalues (expectation values) of a reasonably constructed operator are not the 
same as the values of the corresponding classical quantity. Therefore, we conclude 
that not all operators of the nonrelativistic theory can be transferred to the rela- 
tivistic one-particle theory. The reason for this is the restriction to the one-particle 
concept. In relativistic quantum mechanics the consistency of the one-particle de- 
scription is limited. This may be specified more precisely: From a mathematical 
point of view the formulation of the relativistic theory within a one-particle concept 
implies the condition that the only valid operators are those which do not mix dif- 
ferent charge states. Such operators are called even operators or true one-particle 
operators. More formally, an operator [F] is called even , if 

[F] & i+) = ^ ,(+) , [.F] !^ ( - } = !p r,( - ) (1.160) 

is valid. are functions with positive and negative frequencies, respectively. 

Similarly, an operator {Fj is called odd if it satisfies the conditions 

{F} £' (+) = l F /(_) , {F} , F (_) - l F /(+) . (1.161) 

Therefore, the Hamiltonian of the free Klein-Gordon equation in the Schrodinger 
representation Hf and the momentum operator p = — i.W are even operators. This 
means that 

tif=[6f] , P = [p] • d- 162 ) 

An operator can generally be split into an even and an odd part, e.g. 

F = [F] + {F} . (1.163) 

Therefore, one can separate from any given operator F a true one-particle operator 

[F]. 
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The investigation of even and odd operators can be simplified in the Feshbach- 
Villars representation, especially if one uses momentum eigenfunctions (p repre- 
sentation). In the representation the wave functions of the two charge states are 
given by (1.99) and (1.100), i.e. 


# (+) (p) = 



& ) (p) 



exp 

exp 


ji p-x-E p t ) 
p-x + E p t ) 


(1.164) 


In this representation an even operator is diagonal because of the column vectors 
(q) and (Y) for the two charge states. Therefore, we can deduce that //<;, is an even 
operator, because 


H& = t 3 E„ = 




(1.165) 


Also, for <P(p) (1.164), the momentum operator p is diagonal, since with (1.96) 
and (1.112), we find that 


p * - upu~ l =p 


1 0 
0 1 


- V = [p] 


(1.166) 


An even operator has to be a diagonal matrix in the <I> representation; therefore, it 
will be especially simple to separate the even part of any operator F, 


F<p = 


Fu 

F21 


Fn 

P 2 2 


(1.167) 


according to 


= 


Fu 0\ / 0 F n 

o f 22 ) + \f 21 0 


= [F*] + {F*} 


Having completed the introduction to odd and even operators, we now apply this 
method to the x operator which caused problems in (1.158) and (1.159). In the p 
representation the x operator is given by 

~ 9 d 9 1 

- <ll68) 

which can be calculated to be 


= <U69) 

in the ^ representation (cf. Exercise 1.18). Since f\ is nondiagonal, the true one- 
particle position operator in ^ representation reads 


[x<p] — i^Vnll 


(1.170) 


Clearly, this is the canonical conjugate operator of the momentum operator since, 
due to (1.166), 
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[[£*], , [p]j. 


— i h 


d 

opt 


— ihStj , 


(1.171) 


as it should according to quantum mechanics. 22 With the help of the true position 
operator (1.170) we can calculate the velocity operator : 


£ [a*] - = [v„i ,t 3 E p \_ 


i h 


T3 


'V p ,cJp 2 + m$c 2 


= Ti 


cp 


ft 


T3- 


2 

c L p 


2 + m$c 2 E p 


with eigenvalues 


2 2 

c p , cp 

H and 

E P E p 


(1.172) 


(1.173) 


We have already stated the corresponding eigenfunctions in (1.164); therefore, we 
get 


^Ii£V>(p) 


““T3<P (±) (p) = ± 
En 



$ (±) (P) 


(1-174) 


EXERCISE 


1.18 Calculation of the Position Operator in the Representation 


Problem. Show that the x operator in # representation is given by 


Xrh = i hVpt — 


ihpf\ 

2 (p 2 4- mlc 2 ) 


Solution. We know that in the Schrodinger picture the position operator in mo- 
mentum space is just the derivative with respect to p, i.e. 

x — ihVp . 

By substituting the Feshbach-Villars representation, all operators change according 
to (1.112); hence, 


x& = LJ(ihV p )U 1 , where 

A _ (me 2 + E p ) H - ( m 0 c 2 - E p ) n 
y/4moc 2 E p 


^ See W. Greiner: Quantum Mechanics - An Introduction , 3rd ed. (Springer, Berlin, Hei- 
delberg 1994). 
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Exercise 1.18. 




Therefore, we get 

(mpc 2 + Ep) l- ( m 0 c 2 - E p )f x 

sj4mpc 2 E p 

(mpc 2 + E p ) H + (m 0 c 2 -E p )f\ 
^j4mpc 2 Ep 


(ihVp) 


Since 


e p = \Jml cA +p 2 c 2 , and V p E p = (p • V p ) p = , 


we get 


1 


v p t/-' = ■ , 

yj 4niQC 2 Ep 


a - f x )v p e p - o 


i 


2(4 mpC 2 Ep) 


4 mpC 2 V p Ep 


1 <i 


yJ4mpc l E p 


pc 

'lE 2 


{(11 + T\)mpc 2 - (11 - f\)E p } 


\J 4nipc 2 E p 


pc 2 f\ \ (mpc 2 + E p )H+ ( mpc 2 - E p ) t\ 


2 E 2 


y/4m 0 c 2 E p 


-U 


-\ P C T i 
2E p 


If x& acts on $ the result is 

= u (i hv p ) u- l $= u [ihVpU- l ]<P + uu- l mVp$ 


+ 00~ l ihV p $ 
2E p 

ihV fP^ r, \ * , 

2 [mfic 1 + p l ) j 


as required. 


For states of positive charge (time evolution factor E = +E P ) the same re- 
lation between velocity and momentum holds as in classical relativistic mechan- 
ics, (1.159). For states of negative charge this is true only for the absolute val- 
ues. Antiparticles with momentum p move in the direction opposite to p. This 
is not unreasonable, especially if we think of the current density of the charge, 
e(d[x$/dt]) — e(c 2 p/E p )f 3 , which must change its sign for antiparticles because 
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of their negative charges. Therefore, the operator [x<p] fulfills a number of plausible 
conditions and can be accepted as a true one-particle operator. Since the position 
operator [x&] and the charge operator Q& — e f 3 commute, there exists a set of 
simultaneous eigenfunctions $^(p) with 

[x$] ^fVp) = ihVpQ^ip) = x$^(p) , and (1.175) 

Q^f\p) = ±e$f\p) . 

The ) (p). which are normalized to <5 functions, are given by 

^ +)<p> = 7(S^(») exp<ip ' x/ft) ' 


^ ><p) = TcW (°) “ p(ip ' x/K> ' <U76> 

This is the — p representation. In order to derive the E — p representation (i.e. 
the usual Schrodinger momentum representation) we must apply U~ ] to &±\p) 
according to (1.95), so that 


&±\p) = V 

= (woe 2 + E p )l + (m 0 c 2 - E p ) f, 
yf4moc 2 E p 


(moc 2 + E p ) (q " j + (m 0 c 2 - E p ) ^ 

V 4 m 0 c 2 E p ^ m 


(1.177) 


where U 1 is given by (1.97). This can immediately be calculated, yielding the 
explicit form 


= — = ' 

x ^ y/2 1P& V 4m 0 c 2 E p 


m 0 c 2 ± E P 
jtiqc 2 =F Ep 


exp (ip • x/h) 


(1.178) 


As we have already noted these are the eigenfunctions (1.176) in Schrodinger 
momentum representation. We perform the transition from p representation to x 
representation by 23 


^x ±) (* / ) = J {*'1 p)y ( ±\p)<£p 

= f «P(-^Vft) e)(p)d 3 

J 

1 1 f f m 0 c 2 ± E p ^ exp(— ip • (x' - x)/h) ^ 

(2 nh) 2 ^ / 4niQC 2 J \ m o c 2 -F yf^p 


23 See W. Greiner: Quantum Mechanics - An Introduction , 3rd ed. (Springer, Berlin, Hei- 
delberg 1994), Chap. 15. 
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A lengthy calculation, which we shall perform in Exercise 1.20, yields 

fA±B\ ~ (l/z 1/A ± l/z 9 ^\ _ z 
[ a ^ bJz^i ( l / z 7 / 4 t 1 / z 9 / 4 ) 6 

where 



(1.180) 


_\x - x'\ 
h/moc 

The meaning of this result is that the eigenfunctions of the true position operator 
[x] in Schrodinger position representation are not 6(x - x'), as we would have 
expected from nonrelativistic quantum mechanics, but a kind of smeared-out 6 
function. According to (1.180) the functions are smeared out over a region of the 
order 


z ~ 1 , implying \x - x'\ ~ , 

rrioc 

i.e. the dimension of the Compton wavelength of the particle. 


(1.181) 


EXERCISE 


1,19 Calculation of the Current Density in the ^ Representation 
for Particles and Antiparticles 

Problem. Calculate the current density (1.36) in the # representation for particles 
and antiparticles. 

Solution. By means of the Klein-Gordon field the current density reads 

& = (^) » v + x = , 

3 - {^^3 (T 3 + ir 2 ) V* - (V\P*)t 3 (f 3 + if 2 ) &} , 

which can be verified by direct calculation, 
fl^T 3 (f 3 4- if 2 ) Vtf' — (V>P , ^)f 3 (f 3 + if 2 ) 



= {v* + X*) V (<p + x) - v (c p * + X*) -(<p + x) 
= - V’VV’* . 


Using 

& = U~ x $ , & = , 


1.12 Interpretation of One-Particle Operators in Relativistic Quantum Mechanics 


77 


where U is given by (1.96), and, because of 

(t/t)- 1 = (f 3 u- x t 3 y l = tyuty = r 3 ut 3 , 

<f rt (l + Ti)V^ = ^ t f 3 t/f3(ll + fi)t/ _1 V<? . 

Furthermore, r^{r^ + ir^) = 1 + f\ is valid. We commute U with to give 

(E p + m 0 c 2 ) H + fi (E p - m 0 c 2 ) ^ 

" T! - T! 

_ ^ (E p + mpc 2 ) 1 - (E p - m 0 c 2 ) t\ 
x /4m 0 c 2 E p 

— ? 3 U~ l , 


and we 


get 


^(1- 


l + f,)V!P , = ^ t (ll + f 0(f) -1 ) 2 V<2> , 
because U ~ 1 commutes with 1 + t\ . Since 

= Am ' c i Er ((E p +m„c 2 ) 1 -(Ef- me 2 ) f,)‘ 

= 3 + moc2)2 ' + (£p “ "° c2)2 " “ 2 (E i ~ mic,) f 0 
= ( 2 0? + <'') 1 - 2 ( E l ~ "&*) ) 

= 2^(* ic,(, + i, ) tE ' (1 “ fl) ) ' 
and 

(1 + f,) U~ 2 = 2ni l 2p {m 2 c 4 (1 + f,) 2 + E 2 (H + f,) (1 - fo} 

_ ^ t% , - N 

— ~ — U 1 + T\ ) , 

b P 

then the current density is 


ieh \'tmr\c 2 ^ „ x , ^ x ±mc\c 

3 = -r — < —=r~ (B + ri) V3> - V0 1 — - 

2wto ( Ep Ep 


^<n + f,)*} 

i» n/~' 2 i 

-^-{^(n + fOViP- v^Un + ri)^} . 


L£L p 

For a free particle solution $60 = l/t v /vQ exp[i(p • x — E p t)/h ] we, thus, get 
for the current density 


t density 

,•(+) = 1 epc 2 
J V E p 


Exercise 1.19. 


and the current 
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Exercise 1.19. 


j(+) = y , j(+>d 3 r = . 

For a free antiparticle solution with momentum p, 

= l/y/V (®) exp[i(p * x) + Zspf] the current is then given by 



t hus, we see at once that 


J* = I d 3 r^ ±)t f 3 e^|^^ :t) 

= J , 

i.e. the current is equal to the expectation value of the operator 

d[£*] 

e — - — = ev . 


But why are the currents for particle and antiparticle with energy E p and momentum 
p equal although the charges are different? The reason is that for antiparticles with 
momentum p the velocity operator is -c 2 p/E p , i.e. velocity and momentum have 
opposite directions. The expectation value of the operator, however, has the same 
direction as p because the norm or “charge” of the wave function is included. To 
get the physical velocity we have to divide the expectation value of v by the norm 
of the state. This velocity again has opposite direction to the momentum. One says 
that the antiparticles move “backwards in time”. 


EXERCISE 


1.20 Calculation of the Position Eigenfunctions 
in the Coordinate Representation 

Problem. Carry out explicitly the calculation leading from the exact expression 
(1.179) for the eigenfunctions of position 




1 


= ! 

\mr\C 2 J 


m 0 c 2 ± E p \ explip • (x — x')/h] 


( 2 tt / i ) 3 y/4moc 2 J \ m o c 2 tE p J ^fE~ p 

to the approximation (1.180). 


d*p 


Solution. In the analysis of (1.179) we face, in particular, two integrals: 
h = / \fE p exp[ip • (x - x')/h] d 3 p , 

h = / —4= exp[ip • (x - x')/h] d 3 p 

J V e p 
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With 


p • (x — x) = \p\ \x — x\ cost/ = pr cos u , p = \p\ , 

and V being the angle between p and (x - x'), we can introduce polar coordinates. 
With the z axes in the direction of (x — x f ), the volume element is given by 
d?p — d(^d(cos v)p 2 dp, and we have 


nOQ P 1 pi'll , 

L Li </■ 


h 


iHqC 4 + p 2 c 2 ef- 1 ^ rt)pr cos d ip d(cos $)p 2 dp 


= 2tt j°° [ {fm 2 c 4 + p 2 c 2 eV h) P rcos * d cos tip 2 dp 

poo , e (i /h)pr _ -(i /h)pr 

= 2t tJ y WqC 4 4- p 2 c 2 r—fi P 2 dp 


-1- 


i pr /h 

I V m o c4 + P 2c 2 sin (pr / h)p dp 
r Jo v 

I ^Jm^c 4 -\-p 2 c 2 sin(pr/h)p dp . 


47T.fr '°° 
r 


We write 


( wqC 4 + p 2 C 2 ) 1/4 = x /^2 ^1 + ^2 


P ^ = \/ m 0 c 2 (1 +<? 2 ) 1/4 


with 


P pr , m 0 c 

Q = — , — =QZ ana z = r = /c 0 r , 

mo n fr- 


aud thus obtain 
4 tt(/T, 0 ) 3 


/i = 


\/ mQC~ I (l + <7 2 ) 1/t+ <7 sin(qz)dq = 16tt 3 /1 3 yJmoc 2 B 

Jo 


with 


5= "° 


47T 2 Z 

In an analogous way we find 


I q (1 + < 7 z ) 1/ "sin(^z)d^ 

Jo 


h = — ~ [ (frioC 4 +p 2 c 2 ) 1//4 sin(pr/h)p dp 
r Jo 


47r(moc) 3 1 


\Jmoc l 


pO O 

/ (l + « 2 )' 

Jo 


■1/4 


q sin(^)d^ 


167T 3 /l 3 


with 


\frnc 2 

A =^T Z J qO+Q 2 ) in sin(qz)dq . 


Exercise 1.20. 
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Exercise L20. 


Thus, 


, 1 1 (m 0 c 2 I 2 ±I ] 

-* (±) ' :E ' (2nhy ^4^2 \m oC y 2T I, 

1 1 f\6w 3 H\A ±B) 


Sn 3 h 3 2\\6iT 3 h 3 (A^B) 
A±B 
A^B 


The integrals A und B can be expressed by modified spherical Bessel functions 
(see Example 1.21), giving 


A = 


4t t 2 z u z 

B = *•’ d 




47t 2 z d z 

Here we have used the Basset formula 
cos(qz)dq _ z y -Jn 


r 

Jo 


{q2 + \y+'/2 />- M/2) 


Ky{z) 


These expressions for A and B can at once be verified by substitution. For large z 
the modified Bessel functions can be expanded as 




For i/ = 3/4 it follows immediately that 


* 3/4(Z \>. vf e_Z ( 1 + 3^ 


d z 


(z 3/ % /4 ( z )) 


:»1 V 2 d Z 

j/ K 


e Tz 1/4 + ^z- 3/4 + ... 


-z I _ 7 1/4 _L i,-3/4 

" ' 32" 


3? ( lV4ir v‘ ft >) .5, \/f 


A 3 

u z 

Thus we have 


(z 3/ %/4(z)) 


:»i V 2 


t-Az'i 4 -'A z -y\ 


-7 1/4 , 1^7 -3/4 

32 


fc 3 /-7r 2 1 1 


»i 4 tt 2 V 2 r(5/4)2‘ 


- ? /4 e -Z 
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For v = 1/4 we get 

K l/ 4 (z) 


7T _ z 3 1 

z»i\2z C V 32 z ’ 




^ (z^fe)) 


e-Mz-'/ 4 - 


32 


,-5/4 


+ ... 


7T 

z»i V 2 


Z>1 


_ 7 -l/4 _ jL 7 -5/4 . 
* 32 


e->(z -'/ 4 + i|z- 5/< + .. 

.,"1/4 _ 21, -5/4 j. 

" 32" 


and, therefore, 


B 


kl ITT* 


1 


1 


z»i 4? r 2 V 2 T(3/4) V 2 
with the result that 


-- le- z z“ 9/4 


yjx (±)(& ) 


'l^h/moc 8V2' 


k 3 

— e“~ 


7T 


A5/4) 

W/4)' 


.-7/4 




-7/4 ± 


r(3/4) 

1 

H3/4)' 


r 9/4 \ 


, —9/4 


EXAMPLE 


1.21 Mathematical Supplement: Modified Bessel Functions 
of the Second Type, K v (z) 


The modified Bessel functions are defined by the differential equation 


i&y , _dy 
dz : 


+ zf z -(z 2 + S)y=0 


while the ordinary Bessel functions obey 


(1) 


,d 2 y dy , 2 


2 b=o 


( 2 ) 


(1) can be transformed into (2) by the substitution z — > iz, that is the modified 
Bessel functions are identical to the ordinary Bessel functions with imaginary ar- 
guments. 

Let us denote the solutions of (2) by J^iz) and and the solutions of (1) 

by l y {z) and I- U (z). For z 0, /„, J v are the regular, and J - v , /_„ the irregular 
solutions. Then the following relation 


Liz) = e-'^Mz) 


Exercise 1.20. 


(3) 
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holds. J u {z) and I viz) are real functions of z, whereas J u (iz) is not. This is the 
reason for the special choice of the phase factor in (3). The second type of functions 
K y are defined by 


Ky(z) = 


IT I -y(z) ~ I yiz) 

2 sin utt 


The limit v — > n (n £ N) exists (though will not be proved here) and is 


(4) 


is / \ ^ <• f— viz) IAz) , 

K n (z ) = x hm ( n £ N) 

2 sin V1T 

A few examples are: 

( 2z sinz 


(5) 


J i/ 2 (z) = 
J-x/liz) = 
H/iiz) = 
J-3/2(z) = 
J 5 /2(Z) = 

J -5 lM = \l£ 


7 r z 

2 z cos z 


7 r z 

2 z ( sinz 

7 r 


cosz 


2 z 

7 r 

2z 

7T 

2z 
t r 


cos z sin z 


z 

sinz 


cosz 


3 1 \ 3 . 

—7 cos z 4- -r sin z 


z 3 z 7 ' z 2 


( 6 ) 


One may express the Bessel functions -7 n+] / 2 (z) in terms of the spherical Bessel 
functions j n (z). The relation 24 is given by 


jn(z) = \/^J n + l/2(z) • 

In analogy to (7) also the following relations hold for the LJz) and KJz): 

, , , [2z sinh z 

*\/2(Z) = 

1 -M 2 (z) = 

h/iiz) = 

/_ 3 / 2 (z) = 


( 7 ) 


7 T Z 
2z cosh 7 


sinnz cosnz 


7 T Z 

2z 

7 T \ Z A Z 

2 z ( sinhz coshz 

7 r 


+ - 


7 5/2 fe) = Vl 


Z 

+ - ] sinh z — = cosh z 
Z 5 Z z l 


24 See, e.g. G. Afken: Mathematical Methods for Physicists, 2nd ed. (Academic Press, New 
York 1970), p. 522. For a more extensive discussion of Bessel functions, see G.N. Watson: 
Theory of Bessel Functions (Cambridge University Press, Cambridge 1966). 
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— sinnz + 
z 3 



cosru 


k,/2(z) = {tz°- z (' + ?) ’ 

K5/M= \lTz e ~ l (' + l + 7 2 


Example 1,2 L 


( 8 ) 


EXAMPLE 


1.22 The Kisslinger Potential 

The calculation of bound states of pions in atomic nuclei or the scattering of pions 
at atomic nuclei are complicated many-body problems which cannot be solved 
without simplifications. However, we will now derive a kind of effective potential 
to describe the strong interaction between the pion and the nucleus. The Lagrangian 
of the nonrelativistic pion-nucleon interaction (i.e. the interaction between pion and 
nucleon describing the processes tt - -j- p — > n and tt + -j- n — > p) is 

Ant = gip n (T • V^p + h.c. = 5 (V’„cn/>p)-(V 0 ) + h.c. , (1) 

where yj n and are the neutron and the proton wave functions. The spin vector 
which is porportional to cr acts on the nucleon wave function, whereas the gradient 
acts on the pion wave function 0. This so-called Chew-Low interaction describes 
quite well the low-energy pion-nucleon experiments. The tt~ meson (described 
by the wave function 0) creates the transition of a proton into a neutron; the 
Hermitian conjugate term of (1) describes the transition of a neutron into a proton 
by a 7r + meson (described by the wave function </>*). g is the coupling constant 
for these reactions. The operator product cr • V is rotationally invariant but not 
parity invariant. The parity invariance of the whole interaction is ensured by the 
pseudoscalar <j>(r). 

In the following we discuss the problem of how to describe the interaction 
between one pion and many bound nucleons, starting with this free pion-nucleon 
interaction (1). The simplest approximation is to describe the interaction between 
the pion and nucleon by the free interaction between them. This is called the 
impulse approximation and is reasonable for pions interacting with only one of the 
nucleons inside the nucleus, i.e. the other nucleons just appear in kinematic factors 
(e.g. altered energy-momentum balance of the nucleon). Since the ?r-N interaction 
is of short range (~ 1 fm), and the mean distance A between the nucleons is larger 
than 1 fm, the impulse approximation should be applicable at least for low energies. 
To estimate the potential energy we assume that, on its way through the nucleus, 
the pion has several interactions in series, but that every interaction is only between 
the pion and one “free” nucleon. The model based on these assumptions is called 
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the multiple scattering model. 25 We shall now study the multiple scattering of pion 
and nucleus according to this model. The incoming state |in) and the the outgoing 
state | out) of the pion-nucleus system are related by the S matrix S via 

| out) = S\in) . (2) 

It is convenient to introduce a matrix R = S - H, i.e. the case for no interaction 
(| out) = | in)) is subtracted. We consider the scattering of a particle with momentum 
p —> p' . Then, because of energy conservation, we have 

(p'\R\p) = 8 (E p , - E p ) t {p ^ p 1 ) , (3a) 

where 

t (p -> p') = lim (p'\f(E p + ie)|p) . (3b) 

The T matrix (transition matrix) has a complex argument z = E p -\-\c. To understand 
the intention behind this for the present, purely mathematical, manipulation, we 
consider the theory of Hermitian operators. 

Let Hq = p^/2mo and H — H 0 4- V . The corresponding Green's operators or 
resolvents are defined by 

G 0 (z) = {z - Ho)~ l , 

G{z) = (z — H)~ X , (4a) 

provided the inverse exists. The name is easily understood since 

(z-# 0 )6b(z) = l • (4b) 

Taking the matrix elements of H 0 in coordinate space, we have, since H 0 — 
— ^ 2 V 2 /2m 0 , 

- h 2 V 2 

(x\Ho\ip) = (*l^> • (5a) 

Equation (4a) yields 

(*1 (z - Ho) Go(z) \x') = (*111®') (5b) 

and, after insertion of a complete set \x"). 

J dV' <*| (z - H 0 ) |x") (x" \Go{z)\x') = 8 (3 \x - *') 
or 

f d 3 x" (z -Ho(x)) (*1*") (x''\Go(z)\x') 

= (z- H 0 (x)} <*|6 0 (z)|®') = 6°\x - *') . 


25 K.M. Watson: Phys. Rev. 89, 575 (1953). 
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Since z is a complex number, (5b) results, using (5a) (with Ub) = \x}), in 
(z - Ho) (*|Go(z)|*') = 6 (2 \x - x') 
and, thus, 

( + ( x \^(z)\x') = 5 0) (x - x') . 

In conventional notation (6a) reads 

( h 2 v 2 


\ 2 m 0 


+ z 1 Go(z ; x, x 1 ) = (5 (3) (a: - x') 


(6a) 


(6b) 


Thus Go is indeed a Green’s function, i.e. the solution of a differential equation 
where the inhomogeneity is a point-like source ( 6 function). From the definition 
(4a) we see that Go and G do not exist for all values of z : If, say, E n is an eigenvalue 
of H , i.e. (E n — H)\n) = 0, then G does not exist for z = E n \ Obviously, the 
poles of G signal the eigenvalues of the system. Addition of an imaginary term 
to E , i.e. z = E -f ie defines the movement of the integration around the pole. 26 
In particular it may be shown how, thereby, perturbations (waves) propagate into 
the future, thus ensuring causality. If H has a purely discrete spectrum, E n are the 
eigenenergies and { |/i ) } the orthonormal basis of eigenvectors, then, because of 
H = \n){n\, we obtain 


G(z) = (z -//r 1 



l n )( n l 

— E — E n + ie 


( 7 ) 


Example 1.22. 



Illustration of a typical distri- 
bution of the eigenvalues of 
a Hamiltonian with discrete 
and continuous spectrum 


If, in addition to the discrete spectrum, H also has a continuous spectrum, then a 
branch cut appears in the z plane from E = 0 to E = oo, because every E > 0 is 
an eigenvalue. The branch cut runs along the whole z axis (see above figure). The 
operator identity 


A~ l = B~ x +B~\B -A)A~ 1 , 

(8) 

where A = z — H and B = z — Ho> yields 


G(z) = Gq(z) + Gq(z)V G(z) 

( 9 ) 


which relates G and Gq. 


26 A discussion of this subject may be found in W. Greiner, J. Reinhardt: Quantum Electro- 
dynamics, , 2nd ed. (Springer, Berlin, Heidelberg 1994), Chap. 2. 
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If A and B are exchanged, the following relation can also be derived: 

G(z) = G 0 (z) + G(z)VG 0 (z) . (10) 

Equations (9) and (10) are called the Lippmann— Schwinger equations. 

Next we define the operator 

f(z) = V + VG(z)V . ( 11 ) 

Multiplication with Go from the lhs yields 

Got = (Go + GoVG) V (12) 

or, because of (9), 

G 0 (z)T(z) = G(z)V . (13) 

Inserting this result into (11), one arrives at the Lippmann-Schwinger equation for 
T(z), namely, 

f(z) = V 4- VGoiz)f(z) . (14a) 


For sufficiently small values of V this equation can be solved iteratively, starting 
with the so-called Bom approximation t « V . Substituting this approximation into 
(14a) we obtain f « V 4- V G 0 V . Continuation of this procedure yields the infinite 
Born series given by 

T = V + VG 0 V 4- VGoVGqV + . . . . (14b) 


We should bear in mind that this series does not necessarily converge for all V ! 
However, further considerations of specific examples are based on the assumption 
of proper convergence of the Born series (14b), with which we have a method to 
handle the pion-nucleus problem: Let the Hamiltonian be of the form 

A 

H = fir + 4uc + J2 v * = Ho + V . (15) 

a=l 

Here T n is the free-pion Hamiltonian and H nuc describes the nucleus, which we 
assume to be known. The Lippmann-Schwinger equation for T is now given by 
(14b), using (4) for Gq(E ), so that 

Go = (E — Hq -f- it) -1 , e — > 0 + . (16) 

As we have introduced T as a scattering operator, we can write the multiple 
scattering solution (14b) as follows 


E 


t' + 


E' 

Oi\ , OL2 


t'oLpoi'c 


Oil 


t ai G 0 t' a2 G 0 t' ai + ... 

OL \ .OL2 jOti, 


(17) 


The first term corresponds to the scattering at a single nucleon (summed over 
all nucleons), the second term describes the scattering at two nucleons, etc. The 
notation Y] f means that oq, a?> < 22 , etc., so that we exclude multiple 

scattering at the same nucleon. These terms are already contained in the definition 
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of the V matrix for the scattering of a pi on at a bound nucleon, which, according 
to (14), reads 

t'a = v a + v a G 0 t' a . (18) 

t f a contains the total nuclear Hamiltonian through G 0 , and the free pion-nucleon 
amplitude t a obeys an equivalent equation, in which H nuc in Go is replaced by 
a free one-nucleon Hamiltonian. Thus, V a is a complicated many-body operator, 
whereas t a is noti The elastic scattering is given by 

TVi - (0\t |0) . (19) 

Here |0) represents the ground state of the nucleus. In (19) the integration over the 
coordinates of the nucleus has to be performed. Therefore, T e \ depends only on the 
coordinates of the pion. Analogously to (14), we define the optical potential of the 
pion as 


T e \ — V-n 4- Vjy Gq Tq\ 


( 20 ) 


for which, according to (17), 

T d = 53 ( ? “)el + 53 + • • • ( 21 ) 

Ot OC] ,OC 2 

follows. Consequently, the pion wave function obeys the one-particle equation 

(tr 4- V n ) 0 = £*0 = E(p . (22) 

In principle this optical potential always exists, but in general it is energy dependent 
and nonlocal, as we will see in the following. 

In order to proceed without calculation, we have to make a number of approx- 
imations, which we shall now discuss: 

(1) The Impulse Approximation, According to our previous considerations, the 
scattering inside the nucleus is the same as free scattering and, therefore, 

r a = i a , (23) 

i.e. the complicated many-particle operator for the bound nucleon will be replaced 
by the one-particle operator t a . 

(2) The Approximation of an Uncorrelated Nucleus. Nucleons are fermions. 
Thus, the wave function of the nucleus has to be antisymmetrical with respect to 
the coordinates of the nucleons. Instead, we use here products of the one particle 
wave functions which are not antisymmetrized (i.e. uncorrelated). For different 
nucleons (oq ^ ai) this means that the nucleon, which is excited from the ground 
state |0) to the state | n) by i a2 and then propagates freely through the action of 
G 0 , cannot be scattered back from the state |n) into the ground state by i.e. 

(OIL.) Gq (w|? Q2 |0) = 0 . 


Example 1.22 . 
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Only for | n) = |0) (no scattering) is this equation not valid. At the same time this 
means that, for an elastic final amplitude, all the intermediate processes must be 
elastic too, so that the double scattering term in ( 21 ) becomes 


X ( ? «i^ 0 ?a 2 ) e] 

&li=<X2 


/ ^ ~ql\ c\^G^lx .2 el 
Oil i=OC2 


(24) 


(3) Assumption of Coherence. This assumption implies that the nucleus remains 
in the ground state the whole time during the scattering process, or, in other words, 
intermediate states of the excited nucleus do not exist; the assumption of an un- 
correlated nucleus (24) implies for at f a 2 f . . . 


(7 + \ 


(25) 


In this sense coherence means that, though the pion scatters from nucleon 1 
at the space point x x , the probability of colliding again with the same nucleon at 
x\ is still given by the ground-state wave function. However, the slowly moving 
nucleon will stay close to x\. The probability of colliding with the nucleon twice 
without exciting the nucleus is very small, because the consequence would be a 
large momentum transfer to the nucleon in the intermediate state. Nevertheless, we 
also assume that for repeated scattering at the same nucleon (a } , a 2 , a' 3 . . ., not all 
OLi are different) (25) is still valid, and 


(t ai G 0 t ai G 0 t ai ) e] « t, M e \G 0 t a2 dGot ot 1 el (26) 

According to previous statements, this assumption must be used with care, 
because one would actually expect that the backward scattering terms (connected 
with large momentum transfer) are negligible. With these three assumptions, (17) 
reads as 


Tz 1 = 


E> 

a 


■«'+ E ta.\ el Q) ^*2 

Oil ,OL2 


el + X 


toti elGo?a 2 elGo el + 


• (27) 


Oil,OL2, «3 


(4) Assumption of a Heavy Nucleus. If the number A of nucleons is very large, 
we abandon the restrictions in the sums (27) i.e. we allow a/ = a* (coherence). 
Furthermore, the number of the excited states is then very large, since the sums 
over a/ can separately be performed, with the result 

%\ = G 0 ( ^ e lj 

+ ^ el ) (E'-OMEM*- 

X lj + Gof e i . (28) 

Comparing the result with the definition of V n in (20), T e \ = V n + ^ G 0 7 ei. it 
follows that 

V^X^ei^OlX^! 0 ) > (29) 

because the elastic amplitude of the ground-state expectation value is T e \ = (0|f|0). 
The expression (29) is called the optical potential of the lowest order , and is the 
first term of an infinite series for V n . 
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(5) Absence of Recoil. The final approximation is to take the position of the 
nucleon after scattering a pion as unchanged (no recoil), i.e. if r, r' are the pion 
coordinates and R \ , R[ the coordinates of the first nucleon, we assume that 


= (r' - R,\h\r - Rr) 6^ {R[ - R,) 

= ‘ 0> W -*’>// 00 

x e ik ' - (r ~ Ri) (k’\h\k) e-' lk - (r ~ Rl) 


(30) 


Here ( k'\t\k ) is the Fourier amplitude of the scattering of a pion at a free 
nucleon. From this we can now derive the optical potential of the lowest order, 
and, for that purpose, we integrate over R\ and R[ to obtain 


(r'lUr) 

= H d 3 /?! d 3 RW(R[)(r',R[\h\r,Ri)^ (Ri 


II 


d 3 k d 3 k' 
(2tt) 3 (27 r) 3 


a +i k' • r' /uf 


■ r (k%\k) 


x e 


— ifc • r 


J d 3 /?,|Vh(fli)| 


2 e +i(fc-fc / )-Hi 


(31) 


Summing over all nucleons for the calculation of 


^ (r-'|? Q ei|r-) = (r'| ^r Qe i|0 , 

a a 

^2 f l^a(«a)| 2 e +i(fc - fc,) -^d 3 f? a 

a = 1 J 

— A J g(R)e +i(k - k,) - R d 3 R = Ag(k - k') . (32) 

Here p(R) is the nuclear density distribution per nucleon (therefore, the factor A 
for the number of nucleons), and g(k — k l ) its Fourier transform (form factor). 
Thereby, results the generally nonlocal potential according to (29), 

V^(r,r') = (r'\V n \r) = (r'\ ^ ?o. e i|r) 

a 

= //00' +,r ' r, < k '' v '' fc > e - i ‘' r ■ < 33) 

Conversely, in momentum representation it clearly follows that 

(k f \V n \k) - Ag(k - fc / )(fe / |?|fe) . (34) 

For a better understanding of this result, first consider the simplest case: Let 
the incoming pion wave function be a plane wave with the momentum fc 0 - Then 
the total wave function within the nucleus is a wave packet, centred at .fco, which 


Example 1.22. 
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we call xjj^yk). The form factor of the nucleus Ag(k - k r ) has an extension of 
~ 1 / /^nucleus m momentum space, while the scattering amplitude for scattering at 
one nucleon is (k f \i\k) « l//?nucieon- By multiplication of ^(fc) with g(k - k f ), 
whose width is given by l//?„ucieus < 1 /^nucleon, only the momenta k ~ k* ~ k 0 
contribute to the scattering amplitude, because the pion wave function ^ 0 (fc) is 
centred at k « ko . Therefore, in (34) we can put 

{k'lV^k) ^ A{k 0 \i\k 0 )g(k - k f ) . (35a) 


According to (33), in coordinate space the Fourier-transformed expression follows 
as 


V w - A(k 0 \t\k 0 ) IJ 
= A(ko\i\k 0 ) jj 


d 3 k d 3 k' 
( 27 r ) 3 (2tt) 3 
d 3 k d 3 k' 
(27r) 3 (27 r) 3 
3 k' 


e +ik ' -r' e~ ik -r g(k - k') 


a +i k! *(r — r') — i(fc — fc')-r 


■ r g(k-k') 


= A(fc 0 |?|fco)<5(r - r')g(r) . 


d 3 K 

(27r) 3 


e~ iK - r g(K ) 


(35b) 


The nonlocal potential acts on the pion wave function ip n (r') in coordinate space 
so that, e.g. the stationary Schrodinger equation for nonrelativistic pions reads 


2 m 0 


V 2 ip„(r) 4- 


/ 


d 3 r'V n (r, r'W-xir') = Eip n (r) , 


(36a) 


or the Klein-Ciordon equation, 


(&d 2 _ 

\c 2 dt 2 


- h 2 V 2 + m 0 2 c 2 


ip(r, t) + m 0 


/ dVv > 


(r, r f )\l)(r r , 0 = 0 


. (36b) 


Clearly, the simple approximation (35a) has the effect that the generally non- 
local potential of the pion-nucleus interaction is approximated by a local one. 
The 6 function in (35b) naturally ensures this. Furthermore, the 5(r — r f ) function 
disappears by integrating over r', which has to be done in (36a, b). The result is a 
“common” Schrodinger or Klein-Gordon equation, with a local potential that reads 

Vn(r) = A(ko\t\k 0 )g(r) . (37) 


me resulting potential is local, and proportional to the so-called pion-nucleon 
forward scattering amplitude 

a 0 = (fc 0 |?|fco> • (38a) 


However, the potential V^(r) = Aa 0 g(r) gives an insufficient description of the 
pion-nucleus scattering experiments and it is a small step to assume, analogously 
to (38a), 27 that 

(k f \t\k) — ao 4- a\(k' • k ) . 

27 L.S. Kisslinger: Phys. Rev. 98, 761 (1955). 


(38b) 
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This is the next best generalization to (38a), keeping rotational invariance. Insertion 
into (34) results (after Fourier transformation into coordinate space, see Exercise 
1.23) in 

V%i/j = AaoQi)j - Aa\ V • (QVrp) , (39) 

which represents the so-called Kisslinger potential. It is obviously nonlocal be- 
cause it contains momentum operators which are proportional to V. In view of 
a term proportional to one may deduce a strong surface sensitivity in pi on- 
nucleon scattering, since the gradient of the nuclear density contributes mainly at 
the surface of the nucleus. By means of the Kisslinger potential most of the data 
of pion-nucleus scattering are quite well described, although further evaluation is 
still plagued with certain difficulties (divergencies in the wave functions). 


EXERCISE 


1.23 Evaluation of the Kisslinger Potential in Coordinate Space 

Problem. Evaluate the Kisslinger potential in coordinate space, starting with 
(k , \V 7 ,\k) = Ag(k - k')(k'\t\k) , 

together with (see Example 1.22) 

(k f \i\k) = do + d\(k r • k) . 

Solution. In coordinate space one has 

= A J(a Q + a x k' • k)g(z a )ipAz')e ,k ' ' (z ~ Za) 


x e 


-ik '(z'-Zat) 


d 3 k d 3 k' 

(2tt) 3 (2tt) 3 


dVd 3 z a , 


where z a denote the space vectors of a single nucleon [see (31) and (32) of Ex- 
ample 1.22]. Evaluating the first term oq leads to (37) of the previous example. 
Considering the second term we reexpress the momenta k, k' hv k — i V-- , and 
k' = — iVj, where the gradients act on z and z', respectively: 


V 


Vtt = Aa\ j Q(z a )ipAz')'V z e ,k ' <2 ~ z “ ) 

H 3 jC , 

jd'z'd^ 


A$lr 

fc-(z'- 2 c). u ~ ,v aS-IaS* 


{In ) 3 (2tt) 3 

Since there is no integration over z, we can take V z out of the integrand and 
substitute V z > = — V Za , 

v'<p, =Aa,V, ■ I 


(-VJ 


3 — i k • (z' — z a ) 


d 3 k d 3 k f 

(2tt) 3 (2tt) 3 


d 3 z f d 3 z a 


Example 1.22. 
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Evaluating the integral over k\ 

V'^=Aa x V z • [ Q(z a )i!>Az')(-mz -z a )(-V Za ) 

xe-' Mz '- z » ) ^dVd 3 Za 
(27 r) 3 

= A ai V z - I e(z)^(z , X+V ^ )e- i *■(*'-*^d 3 z , 

= A«r v z • ^(z) J ^(z')(+V z ) Q - ik ^'-^-~E d 3 z') . 

Next, performing the integration over k, one obtains 

V'fa =Aa<V z • (e(z) j ^(z'){+V z )8(z - z')dV) , 
and, after partial integration, 

V'^ir = a\AV z • ^p(z)j + j ipn{z')8{z - z ') dV 

surface 

- f {V^(z'))8{z-z')&\'\) . 

The surface term vanishes if the surface tends to infinity, and we are left with 
V'lpir = ~a\AV • (pVV'tt) , 
i. e. altogether 

V\p v = Aa 0 Q(r)xp n (r) - Aa\V -(p(r)V^(r)) . 


EXAMPLE 


1.24 Lorentz-Lorenz Effect in Electrodynamics and Its Analogy 
in Pion-Nucleon Scattering (the Ericson-Ericson Correction) 

In order to understand our motivation and the physical nature here, let us first 
recall the electrodynamical Lorentz-Lorenz effect. For this reason we consider the 
propagation of light in a dielectric medium. The electric polarization P is related 
to the electric field E via E = aP, a being the polarizibility. Accordingly, the 
D field is given by D = E + 47 tP - (1 + 4* /a)E. Considering the propagation 
along the z axis only, the fields E, D and H are proportional to expfifgz — ut)]. 
The fields E and D have only the components E x and D x , i. e. the magnetic field 
consists only of the component H y . Maxwell’s equations 


1 dD 

- — — = V x H and 
c dt 


( 1 ) 
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V x E = - 


1 OH 

c dt 


imply 


* D X = \(fHy , 

c 

\qE x = i -H x , 
c 


and elimination of the component H y produces 



( 2 ) 

(3) 


Using the relation D x = (1 + An /a)E x one thus obtains the refractive index as 


2 c 2 An 

n — — = e = l -\ 

v“ a 


(4) 


Also the phase velocity in a medium v = u/q is obtained from (3). In order to 
determine a we must have a closer look at the details of the scattering process. 
We choose electrons as the centres of the scattering process, assuming that they 
are harmonically bound by means of a restoring force/. The classical equation of 
motion of an electron moving in the z direction thus reads 


d 2 ^ 

m dTi = -n 


(5) 


where £ denotes the elongation of the electron from its equilibrium position. The 
frequency of this harmonic oscillator is then given by ur R =f/mQ , and the electric 
dipole moment per electron, p x = leads to a moment per unit volume of 


Px = eg£ 


( 6 ) 


where g stands for the electric charge density of the electron. Under the influence 
of the external oscillating field E = aP , the electron performs forced oscillations, 
i. e. the total force acting on the electron is not given by the external field E alone, 
but by the resulting local field 28 E + (An/3)P. This local field is the origin of the 
Lorentz-Lorenz correction! 

To make this more evident let us assume that the oscillating electron be 
surrounded by a small sphere (see figure on the right). The surface cuts some 
polarization vectors, which causes a charge per surface element dcr. The sur- 
face charge can be written as g a = P * dcr, since V * D = 0 is valid if 
real external charges are absent. One can conclude, in accord with GauG’s law, 
that the surface charges on the inner and outer side of the sphere are equal, 
i. e. ( E • dS'Xnside = [(E + AnP) • d^out^e. Averaging over all directions 
Pens 2 9 = P / 3 leads to (^^nside = ( £^)outside + (47 t/ 3)P* in the x direction. 
Accordingly, the equation of motion of the electron becomes 


m 0 d 2 ^/dt 2 = e(E z +(4tt/3)/>,)/£ , 


28 See J.D. Jackson: Classical Electrodynamics , 2nd ed. (Wiley, New York 1975). 
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Illustration of the polariza- 
tion in the surrounding of an 
electron under consideration 
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Nucleon surrounded by a 
sphere free of nuclear matter 


which can be written in a more convenient form using the relation P x = eg£ [see 

( 6 )]: 


m 0 d 2 P X 
ge 2 df 2 


= E X + ^- 7 tP x - -Ep x 
3 ge 1 


The requirement that the electrons should oscillate harmonically, i. e. 

P x -\-coP x = 0 , 


(7) 


leads to the condition 


E x 


ge 2 


mpco 3 

ge 3 


4tt 

T 


P x — OtP x 


which determines the polarizability a as 


_ ( _f mu 2 _ 47 t \ _ ( m 0 ui _ mpu 2 _ 

\ge 2 ge 2 3 J \ ge 2 ge 2 

For the harmonic case the frequency is given by 
1 + 47r/a we, further, obtain 


47T 

T 


( 8 ) 


= f/mp. Considering n 2 = 


A'nge 2 /mp 

(w 2 - u > 2 ) - (47T/3 )ge 2 /m 0 


(9) 


In the above derivation the radiative damping, which leads to a complex refractive 
index, has been neglected. If polarization corrections [2nd terms in the denominator 
of (9)] are ignored, the refractive index diverges when the frequency of the external 
field reaches the resonance frequency (resonance catastrophe), i. e. n 2 — > oo for 
u — > ojr. Thus the Lorentz-Lorenz correction prevents this catastrophe for t u = cu R . 

Relation (9) is of the form used in optics, 

n 2 - 1 = , ( 10 ) 

K 0 


where f(k , lj) represents the forward-scattering amplitude of particles with mo- 
mentum k and frequency uj propagating in a medium which is characterized by 
a density of scattering centers g. The momentum of the incoming particle (light) 
is denoted by kp. This formulation suggests, in analogy to light scattering inside 
a medium with density g, the consideration of the scattering of a pion (initial 
momentum kp) at a nucleus with density g by means of optical methods. 

In analogy to the Lorentz-Lorenz correction —Ange 2 /2ml (9), let us now 
derive the higher-order correction in g of the “optical” potential of a pion. 29 For 
this purpose we assume that a nucleon - as a scatterer - pushes off all other 
nucleons in its vicinity, with the effect that, in a sphere with radius R enclosing 
the particular nucleon, there is no other nuclear matter in it (see figure on the left). 
In what follows we have to show, as in the electrostatic case, that the result is 
independent of R. Therefore, we first rewrite the Klein-Gordon equation for the 
pion containing the Kisslinger potential ( d/dt — > i u). 


29 G.E. Brown, W. Weise: Phys. Rep. 22, 279 (1975). 
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(a; 2 — + V 2 — a \ V * pV) (f)^ — 0 (a > 0) , (1 1) Example 1.24. 

where a\ = Acq according to (39) of example (1.22). The first, less important 
term of the Kisslinger potential (39) has been neglected altogether. This can be 
expressed as 

V • (1 -aig)V4> n = - (lj 2 - ml) 0*. . (12) 


The field 6 „ must be continuous on each boundary. Thus, we can conclude that 
the rhs of (12) must be continuous at a boundary. In analogy to the continuity of 
D in the electromagnetic case, the normal component of 

(1 -a xQ )V6^ (13) 


© 

Ah 



must also be continuous. To make this explicit we enclose each boundary by a 
(cylindrical) volume, as in the figure on the right. In the limiting case of the 
vanishing height of the cylinder, i. e. Ah — > 0, 

/v.(l-^)V^dr = (l- 2 r 1 ,)/vA r .dF^0 • (14) 

r 

Since the rhs of (12) is continuous at the boundary, the expression (1 —a\g)^(/> n • 
d F has to be equal on both sides of the boundary. There exists a direct analogy 
between 


© 

Illustration of the cylinder 
with faces dF and height 
Ah surrounding the bound- 
ary. The different media at 
both sides are indicated by 
(1) and (2) respectively 


(1 - a\g)V<l) 7r — > D and 

Vfa-tE . (15) 

In order to pursue this further, we discuss the Chew-Low model, which is 
based on the Lagrangian [see Example 1.22, (1)] 

Cim = gtptr ‘ ■ 0 6 ) 

For the pion it leads to the equation of motion 

^ + V 2 - = gV • cr(r) (17) 

where 

cr(r) = ihfT'ibir) , (h = c = 1) . 

Accordingly, the pion field couples to the spin density cr(r), and the ^ field should 
be stationary, i. e. 

t) = </) ff (r)e _IW ' . (18) 

We further assume that u ~ m 7T , i. e. we study the case where nuclear effects may 
represent only a small correction to the binding. This is realized in pionic atoms; 
thus, 
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Example 1.24. 


Inserted into (17) this leads to the Laplace equation 

= g'V • cr(r) . (17a) 

As known from electrostatics, 30 the Laplace equation (17a) can be solved by 
means of the Green function with the result 

= -^- / ^_L__ v r , . (19) 

v r / is the gradient acting on r'. This shows explicitly that (g/ 47t)V • a(r') plays 
the role of a charge density as in the electrostatic case. Analogously to the electric 
field E , (14) yields 

(V</> 7 r — a i exterior ' = ( ^ 07 r interior * d F , (20) 

which implies a surface charge 


Qs — Cl 1 q( V0 7r j_) ex terior 


( 21 ) 


V07r_L represents the component of perpendicular to the boundary and gives 
rise to a contribution to V<f> n (r G surface), which we denote by (V</> 7r ) iocai , 

^(^0 7r)local = {^0 “interior ~ ^ ^exterior ) — ‘ — 1 ^^^ >7r ^ exterior ’ (22) 

where the averaging ( ) over all angles, i. e. over the small sphere around the 
nucleon, has already been carried out. This term corresponds to the (47 t/3 )P term 
in electrostatics. Equation (22) is correct to the lowest order in a\o. However, note 
that if (V0 7r )iocai has to be corrected by the term (22), then the same has to be 
done for the Kisslinger term -aig'Vcp^, i. e. we must write 

^(V(/%)iocal — ~ j a 1 £?(^07r)local • (23) 

The influence of the nucleon on the optical potential of the pion happens locally 
at the nucleon. From (23), 


(^0tt) local — (V&r) —a i ^(V^) 7r )iocaI 


(24) 


and thus 


(V^) l0 cal = , (V _^ • (25) 

l+aip/3 v 7 

Here we have renamed (V^^extenor = V^, because this is just the pion 
field inside a medium. Furthermore the corrections of the correction, etc. have 
been summed up in terms of a geometrical series. The expression (25) repre- 
sents the Ericson-Ericson correction 31 to the gradient of the pion field. Replacing 
(V^Kissiinger by (V0 7r )i oca i in the Kisslinger potential leads to 

30 See J.D. Jackson: Classical Electrodynamics , 2nd ed. (Wiley, New York i975) or 
W. Greiner: Classical Electrodynamics (Springer, New York 1998), Chap. 1. 

M. Ericson, T.E.O. Ericson: Ann. of Physics 36, 323 (1966). 
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V^- E - h Mr, t)=A (a*Q - V • { 1 + ^ / 3 v}) Mr, 0 (26) 

as the new optical potential for pions inside a nucleus; and thereby, ~a\ = Aa\ is 
proportional to a\. A more detailed analysis shows that the potential (26) is valid 
only in the limiting case k — > 0 , because in the above derivation we restricted u 
of u ~ rrin- The dependence of u and k on ao is quite complicated, and for this 
reason a i is fitted to experimental data. Until now no unique fit has been found, 32 
because the corresponding experimental effects represent very small corrections. 


Example 1.24. 


1.13 Biographical Notes 


KLEIN, Oskar Benjamin, Swedish physicist, * 15.9.1894, f 1984. Professor for Theo- 
retical Physics at the University Stockholm from 1931 to 1962. Important achievements 
apart from the formulation of the Klein-Gordon equation: Klein’s paradox (see Chap. 13), 
Klein-Nishina formula, Kaluza-Klein theory. In 1960 he was awarded with the Max-Planck 
Medaille of the Deutsche Physikalische Gesellschaft (German Physical Society). 

GORDON, Walter, *3.8.1893 in Apolda (Thuringen, Germany), 1 24.12.1939 in Stock- 
holm, formulated in 1926 independently from O. Klein a relativistic wave equation for free 
particle without spin (Klein-Gordon equation) and derived in 1928 simultaneously with 
C. G. Darwin the fine structure of the hydrogen spectrum from Dirac’s wave equation [BR]. 

LAGRANGE, Joseph Louis, *25.01.1736 in Torino, f 10.04.1813 in Paris. L. originated 
from a French-Italian family, and in 1755 became professor in Torino. In the year 1766 he 
went to Berlin as the director of the mathematical-physics department. In 1786, after the 
death of Frederick II, he went to Paris, where he gave considerable support to the reform of 
the measuring system, and where he was professor at several universities. His very exten- 
sive work contains a new foundation of variational calculus (1760) and its application on 
dynamics, contributions to the three-body problem (1772), the application of the theory of 
chain fractions on the solution of equations (1767), number-theoretical problems, and an un- 
successful reduction of infinitesimal calculus on algebra. With his “Mecanique Anaiytique” 
(1788), L. became the initiator of analytic mechanics. Important for function theory is his 
“Theorie des Fonctions Analytiques, Contenant les Principes du Calcul Differentiel” (1789), 
and for algebra his “Traite de la Resolution des Equations Numeriques de tous Degres” 
(1798). 

SCHRODINGER, Erwin, *08.12.1887 in Vienna, f 01 .04. 1961 ibidem, professor in 
Zurich, Berlin, Oxford, Graz, Dublin and Vienna; worked in theoretical physics, especially 
on quantum theory. Based on L. de Broglie’s idea of matter waves. S. developed the wave 
mechanics in 1926, established the wave equation named after him and proved the equiva- 
lence between the latter and the matrix mechanics of Heisenberg. With these principles S. 
created a homogenous base of quantum and atomic theory. In fact, S. found first the rela- 
tivistic wave equation known today as Klein-Gordon equation, but he dismissed it. Later S. 
worked on problems of relativistic quantum theory, gravitation theory and on a new field 
theory. He also was concerned with philosophical questions. In 1933 he received the Nobel 
Prize in physics, together with P. A. M. Dirac. 


32 E. Friedman, A. Gal, V.B. Mandelzweig: Phys. Rev. Lett. 41, 794 (1978). 
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FESKBACH, Herman, *02.02.1917 in New York, from 1955 university professor at the 
Massachusetts Institute of Technology (MIT) in Boston, from 1973 head of the MIT physics 
department. He is one of the most distinguished theoretical nuclear scientists of his times. 
Among other things he is distinguished by his works in the held of nuclear reactions. 

VILLARS, Felix Marc, *06.01.1921 in Biel (Switzerland), student of W. Pauli, university 
professor at the Massachusetts Institute of Technology (MIT) in Boston. Among others he 
is distinguished by his work on the microscopic structure of collective motions in nuclei. 
The Pauli-Villars regularization procedure is discussed in W. Greiner, J. Reinhardt: Quantum 
Electrodynamics (Springer, Berlin, Heidelberg 1994). 

SCHIFF, Leonard Isaac, *29.03.1915 in Fall River (MA), j 19.01.1971 in Stanford. From 
1941 at the University of Pennsylvania, becoming associate professor in 1944. In 1947 he 
moved to the Physics Department, Stanford (where he was Head of Physics from 1948 
to 1966). Main heids: scattering theory, general relativity. A main part of his academic 
work was the education of young students. His excellent book on quantum mechanics made 
him world famous. In 1966 he received the Oerstedt-medal of the American Association of 
Physics Teachers. 

WEINBERG, Steven, *03.05.1933 in New York, today at the University of Texas, before 
this at Havard University. W. worked on cosmology and together with S.L. Glashow and 
A. Salam propounded the unified theory of weak and electromagnetic interaction (“standard 
model”), which, after this theory, only appear as different forms of one single force. For 
this they were awarded the Nobel Prize in physics in 1979. 

HEISENBERG, Werner Karl, *05.12.1901 in Wurzburg, f 01.02.1976 in Munchen, from 
1927 to 1941 professor at Leipzig University, from 1941-45 director of the Kaiser- Wilhelm 
Institut fur Physik in Berlin, 1946-57 director of the newly founded Max-rianck-Institut fur 
Physik in Gottingen and 1958-70 in Munchen. H. demanded in 1925, that only “principally 
measurable quantities” should be used to describe atomic phenomena (positivistic princi- 
ple). With his “multiplication rules for quadratic schemes” he initiated the new “Gottinger 
Matrix Mechanics”, which was further established in 1925 by M. Born, P. Jordan and H. 
In collaboration with Niels Bohr he succeeded in explaining the physical and philosophical 
background of the new formalism. Heisenberg’s uncertainty relation (1927) became the basis 
of the “Kopenhagen interpretation” of quantum theory. After the discovery of the neutron 
by J. Chadwick in 1932 Heisenberg recognized that this particle is, in addition to the pro- 
ton, an element of the nucleus, and he developed a theory of nuclear structure (concept of 
iso-spin). From 1953 Heisenberg worked on a unified theory of matter (“world formula”), 
which should be able to describe all elementary particles occuring in nature and all natural 
laws. In 1932 he received the Nobel Prize in physics and 1957 the peace class of the order 
“Pour le merite” [BR]. 



2. A Wave Equation for Spin-| Particles: 
The Dirac Equation 


We follow the historical approach of Dirac who, in 1928, searched for a relativistic 
covariant wave equation of the Schrodinger form 

. dxb 

ih-~=Hip (2.1) 

with positive definite probability density. At that time there were doubts concern- 
ing the Klein-Gordon equation, which did not yield such probability density [see 
(1.29)]. The charge density interpretation was not known at that time and would 
have made little physical sense, because 7r + and ir~ mesons as charged spin-0 
particles had not yet been discovered. 

Since an equation in the form (2.1) is linear in the time derivative, it is natural to 
try to construct a Hamiltonian that is also linear in the spatial derivatives (equality 
of spatial and temporal coordinates). Hence, the desired equation (2.1) has to be 
of the form 


i h 


dip 

dt 


he / d A d ^ d 

TT'a? + “ 2 a^ + “ J a? 


+ /3moc 2 


ip = Hftp 


( 2 . 2 ) 


The - yet unknown - coefficients a, cannot be simple numbers, otherwise (2.2) 
would not be form invariant with respect to simple spatial rotations. We suspect 
that the a, are matrices and indicate this by the operator sign A. Then ip cannot 
be a simple scalar, but has to be a column vector 


/ ip\(x, t) \ 

I 1p2(X,t) | 


(2.3) 


\ip N (x,t)J 

from which a positive definite density of the form 


Q (x) = ^ip(x) = (r l ,r 2 ,...,r N ) 


/ V»i \ 

1p2 

\lp N / 


N 


= ^1p*1pi(x) 


i = 1 


(2.4) 


can be constructed immediately. We still have to show that g(x) is the temporal 
component of a four-vector (current) for which a continuity equation must exist 
so that the spatial integral f gd 3 x becomes constant in time. Only then is the 
probability interpretation of gx ensured. It is clear that the wave function ip in (2.3) 
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is a column vector analogous to the spin wave functions of the Pauli equation. 1 
Hence, we shall call them spinors, specifying this name later. The dimension N of 
the spinor is not yet known, but we will be able to decide this soon. The coefficients 
at and P must obviously be quadratic N x N (2.2). Thus the Schrodinger-like 
equation (2.1) and (2.3) represents a system of N coupled first-order differential 
equations of the spinor components t/>,, i = 1,2, ... ,N. We also indicate this point 
in the notation and write (2.2) in the form 


i h 


dip a he 
dt 


N 


= tE 


T — 1 


«1 7TT + &2 


d 


dx x dx 2 


d \ N 

+ «3 dx*) ^ + m oC' 2 ^/3 CTT i/v 

X ' OT T = 1 


N 




(2.5) 


T— 1 


Equation (2.2) is a short form of (2.5), in which the four N x N matrices (a,) CTT 
(i = 1 , 2, 3) and are expressed in the usual abbreviated form for matrices by 
&i ( i = 1,2,3) and ft respectively. To continue, we demand the following natural 
properties: 


(a) the correct energy-momentum relation for a relativistic free particle 

E 2 = p 2 c 2 + m^c 4 , (2.6) 

(b) the continuity equation for the density (2.4), and 

(c) the Lorentz covariance (i.e. Lorentz form-invariance) for (2.2) and (2.5), re- 
spectively. 

To fulfill requirement (a), every single component of the spinor tp has to 
satisfy the Klein-Gordon equation, 2 i. e. 


-h 2 


dt 2 


(-ft 2 c 2 V 2 -I- m%c 4 ) 


On the other hand, from (2.2) it follows by iteration that 


(2.7) 


1 See W. Greiner: Quantum Mechanics - An Introduction , 3rd ed. (Springer, Berlin, Heidel- 
berg 1994), Chaps. 12, 13. 

2 Notice that the analogy to classical electrodynamics, where the six electromagnetic fields 
E Xi E y , E Zi H x , H y , H z satisfy the first-order differential equations (Maxwell equations) 

flj? ctTJ 

(V x H) = — , ( VxE) = , V*£ = 0 , V B = 0 

cot cot 

in a vacuum. Each single component Ej and //, satifies simultaneously the differential 
equation of the second order (wave equation) 


\_tf_ 

c 2 dt 2 


Ei — 0 


and 


V 2 


c 2 dt 2 


Hi = 0 


For further discussion of this analogy see Exercise 2.1. 
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-o d^"lp o j \ — \ OiiOLj + OLjOLi d^\p 

h w = ~ hc £ —2 

*'.7 = 1 


hmoC 3 A . A^ . dip *2 2 4/ 

+ — : — iJpLiP 4 - f3ai)-^j + 13 m 0 c ip . 

f=l 

Comparison with (2.7) shows the following requirements for the matrices a,, $: 


&i &j + &j &i — 26 ij t , 

cti (3 + (3&i = 0 , 

a 2 = /3 2 =11 . (2.8) 


These anticommutation relations define an algebra for the ip matrices. In order to 
establish hermiticity of the Hamiltonian Hf in (2.2), the matrices j3 also have 
to be Hermitian; thus, 


&J=6« , 


(2.9) 


Therefore, the eigenvalues of the matrices are real. Since, according to (2.8), one 
has of = 1 and /3 2 = 1, it follows that the eigenvalues can only have the values 
±1. Because the eigenvalues are independent of the special representation, 3 this can 
best be shown in the diagonal representation of the single matrices. For example, 
6ij in its eigenrepresentation has the form 



(M 

0 

0 

... 0 \ 


0 

A'l 

0 

0 

= 

0 

0 


0 


0 

0 

0 

An / 


with the eigenvalues A\ , . . . , An, and (2.8) now yields 


/I 

0 

0 



o' 

<N I— 

\ 

0 

1 

0 

• 


o a 1 2 


0 

0 

1 

1 

— 


* . 

\.. 



■■■) 


1 .'. .. 

■■■+ ' 


from which 

A 2 k = 1 , i.e. A k = ±1 


3 This follows, because Aip a = o:i6« implies that 


( 2 . 10 ) 


UAU~ l Ulpa = aUlpa , 


and, therefore, 

A'(u-4>P = oc(u^P . 

The solutions of the rotated matrix A' = UAU~ X are just the rotated vectors ip' a = 
Uipa with the same eigenvalues a. 
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Furthermore, from the anticommutation relations (2.8) it follows that the trace (i. e. 
the sum of the diagonal elements of the matrix) of each d, and of ft has to be zero. 
Namely, according to (2.8) one has 

d; = —ft&ift . 


Because of the identity 
tr AB = tr BA , 
one concludes that 

trd, = tr ft 2 a t — tr ft a, ft = — trd, =4> trd/ — 0 


( 2 . 11 ) 


The trace of a matrix is always equal to the sum of its eigenvalues, which can be 
seen if U transforms the matrix d, into its diagonal form, 

( Ax 0 \ 

0-42 „ , , 

= t/d,-I / -1 . 


V.. 


...An / 


Then 


tr 


(M 0 
0 A 2 

v. : . .. 


a n / 


N 
k = 1 


tr t/d,t/ _1 =tr d/f/t/ -1 = trd, 


which proves the above statement. Because the eigenvalues of d, and ft are equal to 
±1, each matrix a, and ft has to possess as many positive as negative eigenvalues, 
and therefore has to be of even dimension. The smallest even dimension, N = 2, 
cannot be right, because only three anticommuting matrices exist, namely the three 
Pauli matrices 4 <j, . Therefore, the smallest dimension for which the requirements 
(2.8) can be fulfilled is N = 4. We now study this case in more detail and indicate 
immediately one possible explicit representation of the Dirac matrices, i. e. 


OCi = 


0 a i 

G[ 0 


p = 


( 2 . 12 ) 


where d, are Pauli’s 2x2 matrices and 11 is the 2 x 2 unit matrix. With the explicit 
form of the Pauli matrices of (1.65), we have, in detail, 



(° 

0 

0 

In 

, 

/° 


0 

0 


0 

0 

1 

0 

| 

/ 0 


0 

i 

a x = 

0 

1 

0 

0 


’ a2 ~ 0 

- 

-i 

0 


\1 

0 

0 

Oy 

' 

V/ 



0 

0 


(° 

0 


1 




1 

0 



0 

0 


0 

-1 



0 

1 


a 3 = 

1 

0 


0 

0 

, 0= j 


0 

0 



Vo 

-1 


0 

0/ 



0 

0 



0 

0 

-1 

0 


(2.13) 


4 See W. Greiner: Quantum Mechanics -An Introduction, 3rd ed. (Springer, Berlin, Heidel- 
berg 1994), Chaps. 12, 13 and especially Exercise 13.1. 
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Indeed, we can easily check the validity of the relations (2.8). For example, 


QiiQij -b ajai 


0 bi 0 aj \ f 0 bj \ f 0 bi 

b; 0 J \bj 0 ) \aj 0 ) \bi 0 

-M+fT -°) 

0 CTjCTj J \ 0 <7/ <7/ J 

bibj +bjbj 0 \ / 28 1 j H 0 \ 

0 <JiGj+Gj<jj) \ 0 26 ,j n J 


2 6a 


n o 

o J 


holds. Here we have used the relation for the Pauli matrices 5 


GjGj + GjGj = 26 ij 11 . (2-14) 

We also notice that (2.12) describes just one possible choice of the Dirac matrices 
6ii, (3. Each set a' = 0 a, U & = 0 3U~ l , which is obtained from the original 
a, , /3 of (2. 1 3) by a unitary transformation U , can be used equally as well as the one 
introduced here [see (2.21)]. In Example 3.1 it will be shown that all representations 
of the Dirac algebra are unitarily equivalent to each other. Therefore, physical 
results do not depend on the special choice of the Dirac matrices a, and 3, but the 
calculations can become particularly simple in a certain representation. 

Next we want to construct the four-current density and the equation of con- 
tinuity. For that we multiply (2.2) from the left by v T = ( d* . vT ■ */’( • V't ) and 
obtain 

ihip j -^-ip = Y + m 0 c 2 ipi(3ip . (2.15a) 

*=i 

Furthermore, we form the Hermitian conjugate of (2.2), i. e. 


-ih 


d ip^ 

~dT 


= -y +m 0 c 2 ^ft 

k = 1 


and multiply this equation from the right by ip, taking into consideration the her- 
miticity of the Dirac matrices (dj = a, , /3 + = 6), to give 


-ih^-ip = -y ^*^& k ip + m 0 c 2 ip^(3ip . (2.15b) 

k = 1 

Then, subtraction of (2.15b) from (2.15a) yields 

ih^(ip^ip) = yY1 ( 2i6 ) 

k = 1 


or 


5 This relation is covered in detail in W. Greiner: Quantum Mechanics - An Introduction , 
3rd ed. (Springer, Berlin, Heidelberg 1994). 
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dg 

dt 


+ div j = 0 


where 


(2.17) 


Q = A (2. 18a) 

(=i 

is the positive definite density (2.4) and 

j k =cip^a k ip or j=c^aip (2.18b) 

is the current density. Here we have symbolically introduced the three- vector 
a = {a 1 , a 2 , a 3 } = {-a, , -a 2 , - 03 } (2. 19) 


and introduced the upper and lower indices according to our former convention 
[see (1.5) and (1.6)]. From (2.17) the conservation law follows immediately in the 
usual way 


d_ 

dt 



( 2 . 20 ) 


where V denotes a certain volume and F its surface. Since g is positive definite 
and because of the conservation law (2.17) we can accept the interpretation of g as 
a probability density [in contrast to the density g obtained for the Klein-Gordon 
equation, see (1.29) which was not positive definite]. Accordingly, we call j the 
probability current density. Here we have presumed that j is a vector, i. e. that 
its components (2.18b) transform under spatial rotations as the components of a 
three- vector. This still has to be shown. Furthermore, {cg,j} should form a four- 
vector. Hence, it should transform from one inertial system into another one by 
a Lorentz transformation. This point and, in addition, the form invariance of the 
Dirac equation (2.2) with respect to Lorentz transformations (we also call the form 
invariance covariance ) have still to be shown, before we can regard the Dirac 
equation as an acceptable relativistic wave equation. 

We also notice that we have achieved a special representation with (2.12). The 
choice of the matrices (2.12) is not unequivocal. One recognizes immediately that 
each unitary transformation S yields the matrices 

a'^S&iS-' , p ! = SpS-' (2.21) 


which also satisfy the algebra (2.8). We check this for the first commutator (2.8), 
as an example: 

S&iS-'S&jS-' +Sa ] S- l Sa i S~ l = 2 

+ aij a • — 26jj 1 


(q.e.d.) 
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EXERCISE 


2.1 Representation of the Maxwell Equations in the Form 
of the Dirac Equation 


Problem. Write the Maxwell equations 


, „ 1 dH n 

curl E H — = 0 

c dt 


, „ 1 9E 4?r . 

curl H — = — j 

c ot c 


in the form analogous to the Dirac equation (spinor equation): 

1 a / 9 i 47r 

'*/’ = # • 

1 t—* ox> c 


(a) 


(b) 


j= o 

Determine the matrices & and their commutation relations and deduce the wave 
equation for ip from (b). 


Solution. We define the four-component column vectors 


ip = 


/M 

fpi 

1p2 

\-lpiJ 


and $ = 


/<Po 

4>\ 

(p 2 


(1) 


where <p 0 = cq, 4>\ = j\ = jx , 4>2 = jz =jy, <h =h = h- Furthermore, we have 
x° = xo = ct, x 1 = —x\ = x, x 2 = —X 2 = y,x i = — x-} = z- Now we define the 
components of ^ as 


ipo = o , xpi = Hi - i£] , ip2=H 2 - i E 2 , V>3 = - i£ 3 . (2) 


From this definition it follows that the matrices a' have partly real and partly pure 
imaginary matrix elements. From (b) we get the equation 


1 o 1 9 . ^ . 
-t ct - — ip + a 
l \ c ot 


it/ , (/ , (_) 

"- 1 tp + a z —'ip + a i —'ip 


d_ 

dx 


d 

dy 


dz 


47T ^ 
<p 

c 


( 3 ) 


Denoting now the matrix elements of the matrices & by a! jk we can write down the 
components of (3) explicitly. In the same manner we write the components of the 
Maxwell equations (a) and compare the coefficients of both systems of equations. 
In order to obtain the correct signs in the Maxwell equations using (2) we infer, 
as there appears a factor — 1/i, that the cf ik only take values ±1 or ±i. 

Important Remark. As tpo = 0, this procedure determines the columns 1,2,3, but 
not column 0. We can fix the remaining column by requiring that cJ is Hermitian 
and that (a/) 2 = B. For the matrices & we now find 
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Exercise 2.1. 


a = 


a 2 = 


/s 3 

a 



/I 0 

0 

°\ 

H - 


0 1 

0 

0 

11 - 


0 0 

1 

0 



\0 0 

0 

1/ 


0 

-1 

0 

°\ 

- 

-l 

0 

0 

0 


0 

0 

0 

— i 


0 

0 

i 

0/ 


0 

0 

-1 

°\ 

0 

0 

0 

i 

- 

-1 

0 

0 

0 


0 

— i 

0 

0/ 


0 

0 

0 - 

-1\ 


0 

0 - 

-i 

0 1 


0 

i 

0 

0 

V 

-1 

0 

0 

0/ 


( 4 ) 


and the operators & are Hermitian. We see immediately that trace a’ = 0 holds 
for j = 1,2, 3 and, thus, we obtain the commutation relations 

a .1 a .2 , a . 2-,1 _ r > a . Ia .2 _ :-3 


^2^3 I ~ 3 ~ 2 

crcr + a a 

~ 3 ~ 1 i a 1 a3 

a a + a a 


= 0 , 

~ 1 w. 2 

a a — xa 

= 0 , 

*2 *3 

a a = la 

= 0 , 

^3-1 • 

a a = la 


and 


(a 1 ) 2 


(a 2 ) 2 = (a j ) z = 11 


,3\2 


(5) 

( 6 ) 


Using these, we infer from (b) that 

3 o \ / -i 3 


(-Ie^)He 

x ;=0 7 x £=0 

“ E (Q!,) 0*1-2 


= ^V 2 - 


) 2 — 

d(ct) 2 

d 2 1 
c 2 dt 2 J 


0 

dx k 

d 2 




ip 


ip = Oip 


(7) 


as the mixing terms are proportional to the anticommutators (5) and hence vanish. 
This means, from (3) and (7), 


l -4—' dxJ 
j = o 


( 8 ) 


Hence, if there are no source terms (<? = 0) the components of ip, that is the 
components of the electromagnetic field, obey a wave equation. If there are sources 
present, then for the upper component of (8) with ipo = 0 


v-^ d<pj dg . 

^ dxJ dt J 

7=0 


( 9 ) 
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follows as a necessary condition of a solution of (b). Obviously, (9) is just the 
continuity equation. We also can recognize the analogue to the Dirac equation in 
the Schrodinger form: 

( - | ^ - ^o) ^ = -4n<p , (10) 

where H 0 = (1/i) YjI=\ & k d/dx k has the same form as the Dirac Hamiltonian for 
vanishing mass wq. 


2.1 Free Motion of a Dirac Particle 


We examine the solution of the free Dirac equation (2.2) (that is, the Dirac equation 
without potentials) and again write it in the form 


i = Hfi) = (ca • p 4- moc 2 {3 )V> . 

at 

(2.22) 

Its stationary states are found with the ansatz 


*p(x,t) = \Jj(x)exp[—(i/h)et] , 

(2.23) 

which transforms (2.2) into 


e'ip(x) = Hfipix) . 

(2.24) 


Again the quantity e describes the time evolution of the stationary state ip(x). 
For many applications it is useful to split up the four-component spinor into two 
two-component spinors 4> and i- e - 


= 


/M 

Ipl 

V "04 / 


and x=ITI . (2.25b) 

Using the explicit form (2.12) for the a and $ matrices (2.24) can be written as 


with 




(2.25a) 


= c 


0 a 
a 0 


P 


■ 2 2 
+ m 0 c 


0 


or 


£ip = ca • px + m 0 c 2 <p , 


ex = ca -pip - m 0 c 2 x 


(2.26) 


Exercise 2.1. 
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States with definite momentum p are 


V 

X 


\ !° ) exp[(i/n)p*aj] 


(2.27) 


The equations (2.26) are transformed into the same equations for </> 0 and xo> but 
replacing the operators p by the eigenvalues p. Ordering with respect to <f>o and 
Xo results in the system of equations 

(e - m 0 c 2 ) l(p 0 - ca • pxo = 0 , 


-ca • ptpo + (e + m 0 c 2 ) Ixo = 0 . (2.28) 

This linear homogenous system of equations for <p 0 and xo has nontrivial solutions 
only in the case of a vanishing determinant of the coefficients, that is 


(e — moo 2 )! —ca • p 
—ca • p (e + wioc 2 )! 


(2.29) 


Using the relation 6 


(a • A)(a • jB) = A • Bt -f i a • (A x B) 


(2.30) 


equation (2.29) transforms into 

(e 2 - mlc A ) 11 - c 2 (a • p) (a • p) = 0 , 

e 2 = m.QC 4 + c 2 p 2 , 

from which follows 

e = ±E p , E p = +cyjp 2 + m%c 2 . (2.31) 

The two signs of the time-evolution factor e correspond to two types of solutions 
of the Dirac equation. We call them positive and negative solutions, respectively. 
From (2.28), for fixed e. 


c(a • p) 

Xo = f— Vo ■ 

moc 1 + e 

Let us denote the two-spinor (po in the form 


(2.32) 


ip 0 = U = 


U, 

U 2 


(2.33) 


with the normalization 


U*U = U?Ux + UZU 2 = 1 , 

where U u U 2 are complex. Using (2.27) and (2.23) we obtain the complete set of 
positive and negative free solutions of the Dirac equation as 

6 Encountered previously in W. Greiner: Quantum Mechanics - An Introduction , 3rd ed. 
(Springer, Berlin, Heidelberg 1994), Exercise 13.2. 
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aw*,,) = N ( c(»%) ) 

\ m 0 c 2 + A E p J V2nh 


(2.34) 


Here A = ± 1 characterizes the positive and negative solutions with the time evolu- 
tion factor e = A E p . The normalization factor N is determined from the condition 


J $l x (x,t)$ p >\'(x,t)d i x = 6\y6(p - p') 


(2.35) 


Hence, 


N 2 (u<U + U< cH& : P)(& - p ]u) = 
V (moc 2 + \E p f ) 


or, using (2.30) 

W 2 ( 1 + 


<r 2 p 2 


(m 0 c 2 + A E„) 2 




(mpc 2 + A E p ) 2 
(mpc 2 + A E p ) 2 + c 2 p 2 

0 me 2 + A E p ) 2 ~ 

(m^c 4 -I- c 2 p 2 ) + 2m 0 c 2 XE p + E 2 


I ( m 0 c 2 + \E p ) 2 
2(m 0 c 2 + XE P )XE P 


l(m 0 c 2 -I- A£),) 

2A£^ 


(2.36) 


The spectrum of e p \ = XE P , corresponding to the spinors 'Epxix, r), is shown in 
Fig. 2. 1 . There appears - as in the case of the Klein-Gordon equation - a domain 
of positive and negative frequencies (“energy eigenvalues”). We will discuss the 
interpretation of the states with A = — 1 in detail later on. We now recognize that 
all states (2.34) are eigenfunctions of momentum 


P&p\ = P^ P x(x,t) . (2.37) 

For every momentum p there are two different kinds of solutions, those with 
A = +1(£ = -I -E p ) and those with A = — \(e = — E p ). We will now show that 
another quantum number, the helicity, can be used to classify the free one-particle 
states (2.34). First we note that the operator 

*-P=(t l)'P C38) 

commutes with the free Dirac-Hamiltonian operator Hf [cf. (2.2)]. Here 


Ek 


+ mpc 2 
|- mpc 2 


S = 



a o\ 
o a ) 


(2.39) 


Fig. 2.1. Spectrum of the 
eigenvalues of the free Dirac 
equation 
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is the four-dimensional generalization of the spin vector operator. 1 We calculate 

[H { , £ • p] _ = [ca-p + /3m 0 c 2 , £ • p] _ = c [a • p, £ • p]_ , 

as P is a diagonal matrix and hence [/3, 17] _ =0. Furthermore, we obtain 

(« • p)(£ -p) — {£ • p)(a • p) 

_ ( 0 &-p\ (&-p 0 \ 

- \&-p 0 j \ 0 &-p) 

(&-p 0 \ ( 0 &-p\ 

V o & ’ p ) \&-p o J 

0 (& • p) 2 \ _ / 0 (& ■ pf \ _ 

(<r-p) 2 0 J l(o--p) 2 0 


Hence, 

[H{, 17 • p] _ = 0 (2.40a) 

and naturally 

[p,£-p]_= 0 . (2.40b) 


P 



(a) 


P 


(b) 


This means that 27 • p, H f and p can be diagonalized together. The same holds 
for the helicity operator 


As 


p_ 

2 IpI 



( 2 . 41 ) 


as we can immediately see by repeating the calculations that led us to (2.40a, 
b). Helicity has an obvious interpretation: it is the projection of the spin onto the 
direction of momentum, as illustrated in Fig. 2.2. 

If the electron wave propagates into the direction of the z axis, we have 


Fig. 2.2. Electrons with posi- 
tive (a) and negative (b) 
helicity. The double arrow 
denotes spin. O and O sym- 
bolize the two possible rota- 
tions of the electron 


p = (0,0, p} 

and, because of (2.41), 



0 

Vo 


0 0 

-1 0 

0 1 

0 0 


0 

0 

0 

-1 


(2.42) 


with the eigenvalues ±h/2. Clearly, the eigenvectors of yi? are 


Mi \ ( H-i \ f 0 

0 ) ’ l 0 ) ’ U, 


u i 


1 

0 


and «_ i 


0 

1 


0 

U— J 


with 


(2.43) 


7 See W. Greiner: Quantum Mechanics - An Introduction, 3rd ed. (Springer, Berlin, Heidel- 
berg 1994), Chaps. 12, 13. 
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Now we can classify completely the free Dirac waves propagating in the z direction; 
we denote them by \, Sz (x,t) and write explicitly 


% 


p,A,+l/2 


= N 


1 
0 
ca z p 


\ m 0 c 2 + \E p \0) / 


exp[i(pz - XE p t)/h] 


( 


^,A, — 1/2 = N 


\ 


ca z p 


exp[i(pz - A E p t)/h] 


(2.44a) 


(2.44b) 


\ m 0 c 2 + A E p V 1 ) J 

From (2.35) one immediately recognizes the validity of the orthonormality relations 


/ 


^p z X — ^AA f ^s z s'S(p z p z ) • 


(2.45) 


2.2 Single-Particle Interpretation 
of the Plane (Free) Dirac Waves 

In this section we shall examine the single-particle interpretation of the free Dirac 
equation and its solutions in greater detail. Many of our considerations will be 
similar to those carried out when discussing the Klein-Gordon equation and, in- 
deed, we can already remark at this stage that we will find that a single-particle 
interpretation can be performed to a large extent after a suitable modification of 
the operators. However, such an interpretation will not stand a rigorous treatment. 

First we shall discuss the interpretation of the energy. From (2.31) we know 
that the eigenvalues of the free Dirac Hamiltonian Hf (2.2) are e = ±E p . As in 
the previous discussion of the Klein-Gordon equation we have to find out whether 
the time evolution factors can be interpreted as energies. In order to answer this 
question we use the canonical (Lagrange) formalism. In the following Exercise 2.2 
it is shown that the Lagrange density, leading to the free Dirac equation (2.2) is 
given by 

o 

C = — & + ihc^V ■ && — nioc 2 #^ j3'& , (2.46) 

and, using this result, we will also calculate, as previously in (1.59-61), the energy 
related to the free solutions (2.44) of the Dirac equation. The result is that 

E = e = ±E p (2.47) 

is identical with the energy of the states. This is different from the case of the 
Klein-Gordon equation: The free Dirac equation possesses positive and negative 
energy solutions, whose energies are given by (2.47). What is their interpretation 
since there is none in the framework of a one-particle theory? With the proposal of 
the hole theory, Dirac showed the following way out of this difficulty: let us assume 
that real electrons are described only by positive energy states (2.47), these are the 
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E k 


2 

+ 777,() C * 


- m 0 c 2 . 


Fig. 2.3. The negative energy 
states with E < moc 2 are 
occupied by electrons and 
form the “Dirac sea”. As we 
shall see later on in greater 
detail, they represent the 
vacuum state and are unob- 
servable; whereas real, i.e. 
observable, electrons in gen- 
eral exist only in states of 
possitive energy 


states with E = +E P . All states of negative energy are occupied by electrons, one 
electron in each state of negative energy and given spin projection [see (2.44)]. 
This is illustrated in Fig. 2.3. 

In this way a real electron of positive energy is also prevented from falling into 
energetically lower and lower states by radiation emission. A radiation catastrophe 
of this kind is averted by the effective Pauli principle which simply does not allow 
these transitions. On the other hand the question arises as to the meaning of a 
hole in this occupied “sea of negative states”. Later on in Chap. 12, we will see 
that this leads to a meaningful description of the positron (the antiparticle of the 
electron). One thing should already have become clear here: The interpretation of 
the negative energy states of the Dirac equation takes us out of the one-particle 
picture and into the many-particle picture (more precisely, infinitely many particles 
are necessary for the formation of the Dirac sea). Therefore a consistent single- 
particle interpretation of the Dirac theory is not possible, a fact which is contrary 
to Dirac’s initial intentions. Figure 2.3 gives an especially clear view of this aspect. 

However, we pursue the single-particle aspect further and devote ourselves to 
the investigation of modified one-particle operators and to Ehrenfest’s theorems. 
This way, perhaps, a consistent single-particle description can be obtained. For this 
it is helpful to introduce the sign operator A , 


Hf ca • p + (3niQC 2 
c^Jp 2 + m 2 c 2 


(2.48) 


Of course it commutes with the free Dirac Hamiltonian H { . Furthermore, A is 
Hermitian and unitary, i. e. 


A = A' = A~' . 

In momentum representation A has an especially simple form, that is 

^ _ c(a • p) + j3m 0 c 2 
E p 

The name “sign operator ” comes from the fact that 


(2.49) 


(2.50) 




A E n 


*&p,\,Sz — )&p,\,s z 


(2.51) 


A has as eigenvalue the sign A(= ± 1 ) of the time-evolution factor. A = +1 means 
positive-energy states, A = — 1 negative-energy states. An arbitrary state with fixed 
A can be written in the form 


#A 


f 

h 


(2.52) 


Then 





Hi 

(p)Tr!Pp,A A 

E p 


d 3 j 


A^a 


(2.53) 
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We can use A in order to introduce the projection operators A± by 

A± = 1(1 ± A) (2.54) 

with the useful properties 

A + $ x=+l = «£ a=+1 , 

A + $x=-\ = o , 

A-\p\=+\ — o , 

A-$x=-\ =*\=-\ ■ (2-55) 

The operators A± split off the positive (or negative) parts of the state to which they 
are applied. As earlier, in the context of the Klein-Gordon equation [cf. (1.160, 
161)], we call operators “even” or “odd" if they transform positive (negative) 
functions into positive (negative) or negative (positive) functions, respectively. 
The product of two even or two odd operators is always an even operator, and 
the product of an even and an odd operator is always an odd operator. Since all 
positive functions (A = ±1) are orthogonal with respect to all negative (A = — 1) 
functions, the expectation value of an odd operator with states of fixed A is always 
zero; hence, 

(•^oddl^A) =0 . 

A consistent one-particle theory can only use states with a specified sign (either 
A = +lorA = — 1), because the energy can only be defined meaningfully in 
that way (cf. our preceding discussion). However, from that it follows that in a 
consistent one-particle theory all physical quantities must necessarily be defined by 
even operators. In the following we shall see that for Dirac’s theory, Ehrenfest’s 
theorems follow under this condition too, i.e. the quantum-mechanical operator 
equations and the corresponding classical equations become identical. Once again 
this means that the mean values comply with the classical equations, a fact which is 
quite significant. We formalize these considerations by splitting up every operator 


A into an even [A] and an odd {A} part 

A = [A] + {A} . (2.56) 

If we simply write in short form \P± for l P\=±\ we obtain 

A&+ - [A]$+ + {A}$+ , (2.57a) 

A^_ = [AW- + {A}^_ , (2.57b) 

AAA&+ = AA#+ = [AW+ - {A}#+ , (2.57c) 

AAA^_ = -AA^_ = [AW- - {AW- . (2.57d) 

Hence, it follows by, say, addition and subtraction of (2.57a) and (2.57c) or, also, 
(2.57b) and (2.57d) that 

[A] = i(A + AAA) , {A} = \(A - AAA) . (2.58) 


We immediately recognize that the free Dirac Hamiltonian Hf is an even operator 
since 

AHfA = Hf and therefore [Hf] = \{Hf + AHfA) = Hf . 
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EXERCISE 

2.2 Lagrange Density and Energy-Momentum Tensor 
of the Free Dirac Equation 

Problem. Determine the Lagrangian density of the free Dirac field. Calculate the 
energy-momentum tensor and interpret the individual results. 


Solution. We claim that the free Dirac Lagrange density has the form 

C = ip(cih^d^ - m 0 c 2 )ip . (1) 

ip ;= 0t-yO is called the spinor adjoint to ip and the abbreviation 7^ stands for 
7 0 = ft, 7' = fta' [cf. Chap. 3 (3.8)]. Also d^ := d/dx^ is a shorthand notation. 
Straight away we introduce these 7^ matrices instead of the a' and ft, but will 
revert to the d‘ , ft representation in all important sections. The 7^ are appropriate 
for the covariant formulation of the Dirac equation. The Lagrangian density (1) 
can therefore be rewritten as 

C — ipi'ihd,ip -I- cipUhy 0 ^' djip - moc 2 ip^Qip , (2) 

since 70 = 7°. Because of <9, = djdx 1 — (V),, this yields 

C = (ihd, + i hca • V — moc 2 7o) ip 
— (ihd, - ca-p - ftm-QC 2 ) ip , (3) 

which uses the a', ft matrices instead of the 7^. We realize that this is the correct 
Lagrangian density if we determine the equations of motion by variation. Variation 
with respect to ip yields 

6 f Cd 4 x dC n dC 

— — — 0 =7 — = — d u =- = 0 

Sip dip d(d^ip) 

=7 (cihy^dp - m 0 c 2 ) ip = 0 . (4) 

This is Dirac’s equation 

i hd,ip — (a -p 4- ftmoc 2 )ip 

= H f ip (5) 


with the free Hamiltonian 

Hf = ( ca • p -1- ftm 0 c 2 ) . (6) 

We recognize that for the solutions of the equations of motion (5) 

6 C(ip) = 0 . (7) 


Variation with respect to ip yields 
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8fCd 4 x 

Sip 


dC d£ 

dip ** did^ip) 


= 0 


ip ifvyn 


d + mc‘ 


= 0 


( 8 ) 


where d ^ acts to the left on ip. One can easily calculate the canonical energy- 
momentum tensor from the Lagrangian density C: 




dC 


did .ip) 


d u ip + 


dC 

d(df,ip) 


d„ip - 6££ 


which follows explicitly with (1) as 


( 9 ) 


— ipihcj ,1 'Y l/ ip — S^ipihcy 17 d a ip -I- 8^moc 2 ipip 


( 10 ) 


From that one obtains the energy density 7%, 

7° o = — ipHha • Vci/j + moc 2 ipip 

= i p^ (a. • pc + /3moc 2 ) ip = ip^Hfip 


( 11 ) 


Consequently 

J T° 0 d 3 x = {ip\H { \ip) (12) 

(that is, the energy), is just the expectation value of Hf in the state ip. By analogy 
the momentum density 7°, is given by 

7°i = ipihcy°diip = ip\p)iCip , (13) 

in other words, p, = (1/c) f T°i d 3 x = {ip\(p),\ip) is the expectation value of the 
momentum operator in the state ip. The components 

T‘j = ipihc {j‘dj - 8-^dfj) ip + 6‘moc 2 ipip , (14) 

which can simply be written for each solution of the equation of motion as 

T j = ipihc'y'djip — —ipypjCip = —ip^a'pjCip , (15) 

are called the components of the stress-strain tensor. The trace of is given by 
7 = 7 % = ipihc ( 7 ^dfj, - 47 a d a ) ip + 4 m 0 c 2 ipip 

= — 3-0 (ihcy^d^ — moc 2 ) ip + moc 2 ipip (16) 

which, for every solution of the equation of motion, just becomes 

7 = nioc 2 ipip . (17) 


One should notice that this is not proportional to the charge density 

g = eip^Qip = eip^ip . 


Exercise 2.2. 


( 18 ) 
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EXERCISE 


2.3 Calculation of the Energies of the Solutions to the Free Dirac Equation 
in the Canonical Formalism 


Problem. Calculate in the framework of the canonical formalism the energies of 
the solutions (2.44) of the free Dirac equation. 


Solution. We can determine the energy as the integral 

E = J T° 0 d 3 x . 


( 1 ) 


If we insert (11) of Exercise 2.2 as well as the solutions (2.44a) for r/> into this, 
we obtain 

- N i{ ,A 


we obtain 
E = 


rriQC 2 + A E p 


( 


x (& • pc 4- /3moc 2 ) 


1 

0 

pc 


\ 


moc 2 + XE p 
0 


d 3 x 


Since for (2.44) p = (0, 0 ,p) holds, we get by calculation 

E = N 2 V (l,0, TITF’ 0 ) 

\ m 0 c 2 + \E p J 

0 a 


pc 


0 


+ m 0 c 


11 0 

o -n 


i 

o 

pc 


m 0 c 2 + XEp 
0 


= N 2 V 1,0, 


pc 


{ 


( 


tn^c 2 + XEp 
pc 


,0 


moc 2 + XEp 


= N 2 V 


pc | 0 

V V 0 

2 p 2 c 2 


\ 


J 


( 


+ m 0 c x 


1 

0 

-pc 


\ 


V 


m 0 c 2 + XEp 
0 


\ 


+ moc 2 


2 2 2 
moc L p L c L 


moc 2 + XEp ^ (moc 2 + A Ep) 2 

= N 2 V [moc 2 + XEp) 2 ( 2p 2 c 2 moc 2 + 2p 2 c 2 XE p 

+ w.qC 6 + Im^c^XEp + m^c 6 + moc 2 p 2 c 2 — moc 2 p 2 c 2 ) 
= N 2 V (moc 2 + XEp) 2 (2(A E p ) 2 + 2niQC 6 + 2moc 2 p 2 c 2 ) 
= N 2 V (moc 2 + XE p )~ ] 2X 2 E 2 


( 2 ) 


( 3 ) 
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Here we have used wave functions which are normalized with respect to a finite 
spherical volume V . If we furthermore insert the normalization factor N from 
(2.36) 


N 2 = V 


2A E„ 


m 0 c 2 + A E„ 


-l 


(4) 


we finally obtain 


E = A E„ 


that means 

E = + \Jp 2 c 2 4- twqC 4 

for the upper energy continuum (A = +1) and 
E — — \Jp 2 c 2 + m^c 4 

for the lower energy continuum (A = — 1). Hence, the free Dirac equation leads to 
states with positive and negative energy. 


Exercise 2.3. 


As for the Dirac Hamiltonian Hf, for the momentum operator p also 
ApA = p , so that [p] = p ■ 


Let us also determine the even part of the a operator. We have 

, » c&-p + (3m 0 c 2 ^ c(a • p) + fimoc 2 „ „ 

Aa,A = j — a-, , — = — a,- + 2c/?, 


A 


cyjp 2 + m^c 2 c^Jp 2 + m 2 c 2 


c\Jp 2 + m-qC 2 


and therefore 


[a,] = f (&/ + A&iA) = cpi — 

c yP 2 + m-lc 2 


(2.59) 


In a similar way one calculates 

[/3] = t ^-A = m 0 c 2 — ^ . (2.60) 

p cyp 2 + mlc 2 

Now we determine the velocity operator in the Dirac theory. Since we are con- 
cerned with an equation of Schrodinger type [cf. (2.2)], the theorems for the time 
derivatives of operators which we formulated in (1.157) are also valid here and we 
obtain 

^ = k [*> ^ f l - = k t 4, c(& ■ ^ + /5ra ° c2 ] - 

c 

r A /V -i A A 

= — [x , a -pi- = col = v . 
in 


(2.61) 
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Since the eigenvalues of a have the values ±1 we obtain here the paradoxical result 
that the absolute value of the velocity of a relativistic spin-^ particle always equals 
the velocity of light. 8 Moreover, since the a ; do not commute with each other the 
components of the velocity dxi/dt would not be simultaneously measurable. Of 
course, this is nonsense and certainly would not yield the classical relations for the 
mean values (Ehrenfest’s theorems). According to previous statements, however, 
we find, using (2.59), the true velocity operator , that is the even part of dx/dt , to 
be 


dx 

~dt 


= c[&] 


c 2 pA 


'\[p 2 


+ m^c 2 


(2.62) 



Fig. 2.4. Illustration of the 
Zitterbexvegung in the forced 
one-particle picture. The lo- 
cation x performs a Zitter- 
bewegung around the mean 
(classical) trajectory [x] 


Accordingly, the true velocity equals c 2 p/E p for positive and —c 2 p/E p for negative 
free solutions of the Dirac equation. This exactly corresponds, for positive solutions, 
to the classical picture. For waves with negative energy the result is paradoxical in 
the first instance since their velocity is directed against their momentum. Therefore 
particles with negative energy behave formally as if they would have a negative 
mass. Later on we shall comprehend these facts better in the context of the hole 
theory (Chap. 12). Accordingly, the motion of a single particle can be visualized as 
follows (cf. Fig. 2.4). The particle has to perform - if we adhere to the one particle 
interpretation - a kind of Zitterbewegung (trembling motion) around the classical 
location [x] (classical trajectory). 

Now we investigate the result (2.62) from yet another point of view. If we 
consider wave packets built up from free Dirac waves, we show that only the 
mean of the centre of the wave packet generally follows a classical trajectory. 

In order to understand this we integrate the equations of motion in the Heisen- 
berg representation : 


dx 

~di 


1 

\h 


[x,6 f ] 


= COL 


(2.63) 


dot 1 r -i i r /v /\ -| 2i 2i cv 2i 

dT “ m [“■"']- = S + “"'1 - V ~ V m • P-M) 

Since p and Hf are constant in time (because [p,H{]- — 0 — one can 

easily integrate the former equation and obtain 


a(t) = ^a(O) - exp (-2i H f t/h) + ^ . (2.65) 

If this result is inserted into (2.63), then x(t) can be explicitly calculated with the 
result 


c 2 jj / c 'b \ i he A 

x(t) = x(0) + -jT-t + ia( 0) - -r- J — T- exp (-2\H f t/h) . (2.66) 

This result determines the space operator x(t) in the Heisenberg representation at 
time t. Of course, the operator x(t) also depends on a(0) and ai(0). Making use 
of (2.66), the equation of motion of the centre ( x(t )) of each wave packet can be 


This was first recognized by Gregory Breit in 1928. 



2.2 Single-Particle Interpretation of the Plane (Free) Dirac Waves 


119 


determined. It is interesting to compare this with the classical equation of motion, 
which reads 


CC C |(0 = a?d(0) + 


2 

c p 


(2.67) 


The comparison of (2.67) with (2.66) shows that the free wave packet indeed 
follows the uniform classical motion [first two terms in (2.66)], but that moreover 
a rapidly oscillating motion is superposed, namely 


‘(“(O) - ^ ex P (~2i ftft/ti) 


(2.68) 


The amplitude and frequency of these additional oscillations are of the order 
h/2moC and IhiqC 2 / h, respectively. This oscillating motion is the previously men- 
tioned Zitterbewegung. It vanishes if wave packets with exclusively positive or 
negative energy are considered. This becomes clear if we calculate, with (2.54), 


V Hf) 2 H{ 


(2.69) 


where A± are the projection operators for the states of positive and negative energy, 
respectively. Namely, 


[H { , &]_= 2 cp - 2aH f , 

Hf& + &H f = 2 cp ; 

[yi ± , a] _ = - [(1 ± A), a] _ = ±-[i, a] 


1 

“ ± 2 


H { / 

E ,a 
Pp 


= ±- 


2 

1 1 

'lE n 




cp 

± Y toi T 


[HfJ±}_ =0 
HfA± = ±EpA± 


(2.70a) 

(2.70b) 

(2.70c) 

(2.70d) 

(2.70e) 


With this we can write 

A±(& 


cp \ * exp(— 2 iHft/h) _ - exp(— 2 iHft/h) 
A± t = A- 


H f J ~ 2Hf 2H f 

and, because of the preceding calculation (2.70c), one gets 

X - - 4 , C P -r. - 

yl ± a = aA± ± ~ T ol— . 

tLp 


Thus one obtains 

A = 


„ X , C P ~H f cp 

ocA± ± — =F — 

E„ E„ H f 


A a 


and the identity A±A± = A± for the projection operators is recovered in a trivial 
way, finally resulting in 



120 


2. A Wave Equation for Spin-1 Particles: The Dirac Equation 


A = 





(±E P ) 


cp 

w f 


A± = o 


Hence, the Zitterbewegung is caused by the interference between the positive 
and negative energy compounds of a wave packet. It demonstrates that in a real 
sense a single-particle theory is not possible; it can only be approximately obtained 
if the associated wave packets can be restricted to one energy range. 


2.3 Nonrelativistic Limit of the Dirac Equation 


Before we proceed further with the extension of the Dirac theory, it is important 
to check whether the Dirac equation yields physically reasonable results in the 
nonrelativistic limiting case. First we study the case of an electron at rest ; in this 
case we obtain the Dirac equation by setting p'tp = 0 in (2.2), 




PmoC 2 tp . 

In the particular representation (2.12) with 


P = 


1 0 

o -n 


we are able to write down the four solutions 

/!\ 

exp[— i (m 0 c 2 /h)t] , ip (2> 


,(i) 




V; (3) = 


o 

0 

Vo/ 

/°\ 

0 

1 

Vo/ 


(2.71) 


exp[— i(moc 2 /h)t] 


exp[+i(m 0 c 2 //i)f] , i/> (4) = n exp[+i(m 0 c 2 /ft)t] . (2.72) 


The first two wave functions correspond to positive, the last two to negative en- 
ergy values. The interpretation of the solutions with negative energy still causes 
problems and is postponed [cf. (2.47)]; however, the correct interpretation leads to 
a considerable triumph of relativistic quantum theory by forecasting and describing 
antiparticles. At first therefore we restrict ourselves to solutions of positive energy. 
In order to show that the Dirac equation reproduces the two-component Pauli equa- 
tion 9 in the nonrelativistic limit, we introduce the electromagnetic four-potential 


A" = {Ao(*), A(x)} 


(2.73) 


into the Dirac equation (2.2). We know that the minimal coupling 
p»^p»- i A » = ft* 

C 


9 See W. Greiner: Quantum Mechanics - An Introduction , 3rd ed. (Springer, Berlin, Heidel- 
berg 1994), Chap. 12. 
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ensures gauge invariance of the theory, where 77^ is the kinetic momentum and 
p M the canonical momentum. So we are inevitably guided to the Dirac equation 
with electromagnetic potentials 


C - ~ A 0 ^ ^ p - + $m 0 c 2 ^j ^ 

or 

i • (p - + eA 0 + /W 2 ) V’ • 

This contains the interaction with the electromagnetic field 

H' = --cat • A + Mo = — -v • A + Mo , 

c c 


where 


dx 

V — — = COL 

dr 


(2.74) 


(2.75) 


is the relativistic velocity operator. The expression (2.75) corresponds to the clas- 
sical interaction of a moving charged point-like particle with the electromagnetic 
field. The velocity operator, however, is the formal operator v from (2.61) which 
contains the Zitterbewegung. 


EXERCISE 

2.4 Time Derivative of the Position and Momentum Operators 
for Dirac Particles in an Electromagnetic Field 

Problem. Calculate the time derivative of the position operator x and of the kinetic 
momentum operator tl — p — (efc)A for Dirac particles in an electromagnetic 
field. Compare the result with the corresponding classical expressions and discuss 
Ehrenfest’s theorems. 


Solution. The Hamiltonian of a Dirac particle in the electromagnetic field is [setting 
Aq(x) equal to 4>(x)] 


H — cot • [p A ) + (3moc 2 + ecf) . 


Furthermore, the equation of motion of an arbitrary operator F is given by 

dr dt + h [H ’ ]_ ’ 

and for the position operator we obtain 
Ax i 


( 1 ) 


( 2 ) 


dr h 


[H,x] 


( 3 ) 
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Exercise 2.4. 


since dx/dt — 0. With (1), the commutator reads 

[H,x]_ = c[a. -p, x]_ - e[a ■ A, &]_ 

+ moc 2 0, x]_ + e[<j), *]_ , 


( 4 ) 


<t> is the Coulomb potential, i.e. [o. x]_ = 0. The position operator x is nothing 
but a simple multiplication operator xxp — X'ip (i. e. it is diagonal with respect to 
the spinor indices) and contains no differentiation. Hence [$,x]_ = 0 = [a,x]_. 
Furthermore we will use the identity [Afi,C]_ = [A, C]-B + A[B, C]_ to show 
that 

[a • p, x \ _ = ^2 ([<% > *k]-Pj e* + &j [pj , x k ]_ e k } 

i,k 

E h ^ h ^ . h 

-rajej = -rex , since [pj,x k U = -6# . (5) 

J 


[& • A , «]_ = 0 since x commutes with a as well as with A: we obtain [H , *]_ = 
( h/i)coL , and hence 


d* 

~dt 


— cot = v ; 


( 6 ) 


hence the velocity operator of a Dirac particle is given by 


v = ca. 


(V) 


Let us now see how this operator acts on a Dirac spinor. Considering the single 
components, this reads as 


flip — C&jlp = ±cip 


since the operator a has the eigenvalues aj = ±1. This result means that a Dirac 
particle always moves with the speed of light and it is clear that this finding has 
no classical analogy [cf. the discussion following (2.61)]. The equation of motion 
for the kinetic momentum II = p — (e jc)A is 


A 

d/7 

dr 


dfl 

dt 


1 A /V 

+ -[//,/ 7 ]_ 




e_dA 
c dt 


( 8 ) 


because the potential A(r , t) can depend on time explicitly. The commutator is 
given by 


[H,nU = [H,p]_--[H,A]_ . ( 9 ) 

c 

First we calculate the single commutators 

=c[a-p,p]_ -e[a-A,p\^ + m 0 c 2 [/9,p]_ +e[<f>,p] , (10) 

and p = -i KV, it follows that i.e. [$,p] = 0 since /3 does not depend on space 
coordinates. Furthermore it holds that 


e[(j),p]- = ie/i[V, </>]_ = ieh,(V0 — <^V) 
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Making use of this relation we obtain 

e[<^,p]_^ = ieh(V(p — 0V)t/j = i ehi^cj)^ — (fr'V'ip) 

— i eh{'i/j'V(j) + — (^V^} = i eh(V<f))ip 


for any wave function i.e. e[(f),p]- — i eh(V(f)). Further we find 


c[a-p,p]_ = cY2{[ai,pj]-pjej +a i [p i ,p j ]-e j } = 0 , 

hj 

because [/),,/),]_ = 0 and = 0 (since &; does not depend on the space 

coordinates). This leaves 

-e[a-A,p]_ = -e'^2^[a i ,p j ]_A i e j + d; [A / ,p_ / ]_ey| 

hi 

= ~e^2&i[ A i,Pj]- e j ■ ( 11 ) 

hi 


The second commutator in (9) yields 

— - [H , A]_ = — - jc [& -p, A]_ — e [& • A, A]_ 


+ ^oc 2 [/3, A]_ + e[</>, A]_ | 


Since [/3, A]_ = [0, A],- = 0 and [a • A, A]_ = 0 (A commutes with & and also 
with itself), we get 


-e[oc • p, A]_ = — e 


> 4/] _ M + A« [Pi , Aj] _ ej } 


!>7 


= [Pi,Aj]_ej . 


( 12 ) 




In total we have 


dn _ i 

&t h 


i eh(V<j>) - e^2&i {[Ai,pj]- + [pi,Ay]_}e; 




e 9A 
c dt 


= +« ( - - w) +^E“< {- K.ftl- - ■ <13) 




£ 


Additionally, 

[p/>Ay] _lf) — i h (V,Ay — Aj V[) ift 

= -ih(y i (AjiP)-A j V^) 

= -ih{ipVjAj + AjV iip - AjV tip} 

= -ih (V iAj) i) , 

where the gradients act on A only! Hence [/?, , A,] _ — [/),, Ay] _ = ihCVjA, - V,A ; ), 
and thus we find 


Exercise 2.4. 
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Exercise 2.4. 


— el E + - ^ c&i — V r A ; ) ej = e ( E + -ca x curl A j . (14) 

ij ^ 

The final step is verified by performing some simple algebra, leading to 

r'( 8t ! #xj ) • (15) 

In the classical case this is just the Lorentz force. Thus, we realize that the operator 
x does not satisfy classical equations of motion whereas a “classical” equation of 
motion can be set up for the operator 17, in which the formal velocity operator (7) 
appears. Hence (15) seems to coincide, at least formally, with the corresponding 
classical equation, although we always have to bear in mind that any expectation 
values of (15) are not very useful because v contains the Zitterbewegung. Only 
the projection of the even contributions of (15) produces results which are relevant 
within the scope of a (classical) single-particle description. Obviously at this point 
we reach the limits of the single-particle interpretation. 


The nonrelativistic limiting case of (2.74) can be most efficiently studied in the 
representation 


^ = 


X 


(2.76) 


where the four-component spinor xp is decomposed into two two-component spinors 
(p and X- Then the Dirac equation (2.74) becomes 


i h 


d_ 

di 


<? 

X 


can x 

ca • 77 (p 


~\~ gAq 


<p 

X 


+ niQC 2 


-x 


(2.77) 


the d, as well as the j3 matrices having been inserted according to (2.12). If the 
rest energy moc 2 , as the largest occuring energy, is additionally separated by 


X 


<P 

X 


exp[— i (moC 2 /h)t] 


(2.78) 


then (2.77) takes the form 



X 


C<7 ■ fl X 

ca • 77 


+ cAq 


X 


— 2moc 2 


0 

X 


(2.79) 


Let us consider first the lower (second) of the above equations. For the conditions 
\ihdx/dt\ < \m 0 c 2 x\ and \eA 0 x\ < |mo^ 2 xl 0 e - if the kinetic energy as well 
as the potential energy are small compared to the rest energy) we obtain from the 
lower component of (2.79) 


X = 


an 


V 

2moc 


(2.80) 


This means that x represents the small components of the wave function xp, a 
result we already know from (2.32), while <p represents the large components % ~ 
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(v/2 c)ip. Insertion of (2.80) into the first equation (2.79) results in a nonrelativistic 
wave function for </? 


,S 8T = ^ V + ' 

With the help of (2.30) we continue the calculation, 
(cr • fl)(& ■ fl) = n 2 + i<x . (J7 x fl) 


(2.81) 


= ( p A ) + i & 


= P 


- iKV A ) x [ - ihV 

c 


A) --/hx-(VxA) 
c ) c 


, A e ,\ eh ^ 

= p A ) cr • B 

c ) c 


and finally obtain (2.81) in the form 



2mo 


eh 


2mQC 


<r • B + Mo 




(2.82) 


This is, as it should be, the Pauli equation. 10 The two components of tp, there- 
fore, describe the spin degrees of freedom, which we have already dealt with in the 
section dealing with free Dirac waves [cf. (2.38—44)]. From the former discussion 
on the Pauli equation we know that this equation, and hence also (2.82), yields the 
correct gyromagnetic factor of g = 2 for a free electron. This can be demonstrated 
once again by turning on a weak, homogeneous magnetic field 


B = curl A 


A = \B x x 


where the quadratic terms of A in (2.82) have been neglected. With 


p - -A 
c 


P — x x 
2c 


■ p 2 (B x x) • p 

c 

ri e 

p 2 B • L , 

c 


where L = x x p is the operator of orbital angular momentum, and 


s = \h& 


is the spin operator, it follows for the Pauli equation (2.82) that 






— (L + 2 S)-B + M 0 
2moc 


<P 


(2.83) 


This form shows explicitly the g factor 2. However, the most important result is 
that, in the nonrelativistic limit, the Dirac equation transforms into the Pauli equa- 
tion, i. e. to the proper nonrelativistic wave equation for spin-| particles. Since spin 


10 This is well known from W. Greiner: Quantum Mechanics - An Introduction, 3rd ed. 
(Springer, Berlin, Heidelberg 1994) Chap. 12. 
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exists both at low as well as at high velocities, this implies that the Dirac equa- 
tion describes particles with spin In contrast to the Klein-Gordon equation [cf. 
(1.140), Chap. 1], valid for spin-0 particles, we have now found a relativistic wave 
equation for spin-1 particles. Clearly spin comes into the theory by linearization 
of the second-order differential equation (Klein-Gordon equation). 11 


2.4 Biographical Notes 

DIRAC, Paul Adrien Maurice, British physicist, * 8.8.1902 in Bristol, f 2.10.1984 
also in Bristol. With his fundamental investigations he contributed essentially to the 
formulation of quantum mechanics and quantum electrodynamics. In 1933 Dirac 
was awarded the Nobel prize in physics, together with E. Schrodinger. With many 
original contributions, he initiated modern developments in physics (e. g. magnetic 
monopoles, path integrals). He was one of the really great physicists! 


"cf. in this context the linearization of the Schrodinger equation in W. Greiner: Quantum 
Mechanics - An Introduction, 3rd ed. (Springer, Berlin, Heidelberg 1994) Chap. 13. 



3. Lorentz Covariance of the Dirac Equation 


A proper relativistic theory has to be Lorentz covariant, i.e. its form has to be 
invariant under a transition from one inertial system to another one. To establish 
this we will first restate the essentials of Lorentz transformations and also refer to 
Chap. 14 for supporting group theoretical arguments. 

Two observers, A and B , in different inertial systems describe the same physical 
event with their particular, different space-time coordinates. Let the coordinates of 
the event be for observer A and x' !l for observer B. Both coordinates are 
connected by means of the Lorentz transformation 

3 

(x'Y = '^2,a u ^ = a v ^ = (a"x) v . (3.1) 

n — 0 


a denotes the abbreviated version of the transformation matrix and x the four- 
dimensional world vector. Equation (3.1) is a linear, homogeneous transformation 
and the coefficients a" depend only on the relative velocities and spatial orientations 
of the reference frames. The distance between two space-time points is invariant 
under the Lorentz transformations (3.1), which can be expressed differentially by 
means of the invariance of the line element (see Fig. 3.1) 

ds 2 = dx M dx^ = dx^ dx" . (3.2) 



x> 1 + dx> 1 

dxi'dxp = (dx'Y^dx'),, 


Figure 3.1. 


Hence 


dx^dx M = (dxy(dx% . (3.3) 

This may also be deduced from the empirical fact that the velocity of light is the 
same in every inertial system. 1 Now, from (3.3) and (3.1) it follows that 

(dx , y i (dx , ) At = dx" dx^ = dx" dx^ = 6% dx" dx a ; 

hence 


a\a^ = 61 . (3.4) 

These are the orthogonality relations for Lorentz transformations. We now distin- 
guish between proper and improper Lorentz transformations. Namely, from (3.4) 
follows 


See J.D. Jackson: Classical Electrodynamics , 2nd ed. (Wiley, New York 1975). 


i 
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Figure 3.2. 


[det«)] 2 = l , i.e. det«)=±l . (3.5) 

For the proper Lorentz transformations the determinant of the transformation co- 
efficients is 

det«)=+ 1 . (3.6) 

These proper Lorentz transformations can be obtained from the identity (which 
has determinant + 1 too) by an infinite number of successive, infinitesimal Lorentz 
transformations. They consist of the group of all transformations of coordinates 
from one coordinate system into another one which moves with constant veloc- 
ity in an arbitrary direction. Normal three-dimensional rotations and translations 
belong to the proper Lorentz transformation too. The improper Lorentz transfor- 
mations contain a (discrete) reflection either in space or in time. Such discrete 
transformations can not be obtained from the identity by successive infinitesimal 
transformations. The determinant of the transformation coefficients of the improper 
Lorentz transformation is 

det«) = -l . (3.7) 

It will now be our task to find a relation between the measurements of observer 
A and those of observer B which have been performed by both of them in their 
respective inertial systems. More precisely, we have to find a relation between 
(Fig. 3.2) 

(ifr/ 1 iK * ) = 0 and ^7^^; - m o tp'Cx) = 0 . 


- i 

ip(x) 

i 

A 

V 

^ * 

<//(*') 

x° = ct 

system A 


and 

x 10 = ct' 

system B 


For given -0( jc ) for A, the transformation must enable us to calculate tp'( x ) for B. 
The requirement of Lorentz covariance now means that x ) in system A as well 

as x ) in system B have to satisfy the respective Dirac equations, which have 
the same form in both systems. This is precisely the relativity principle: only in this 
way do both inertial systems become completely equivalent and indistinguishable. 

In the following considerations it is much more convenient to denote the Dirac 
equation in four-dimensional notation to show the symmetry between the time- 
coordinate ct and the space coordinates x l . Therefore we start with [from (2.2)] 


d ^ <9 - 2 

ih— +ihc - pmoc 1^ = 0 , 


k = 1 


dx k 


which we multiply by (3/c from the lhs to obtain 
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+ S p &kih 'is - ^ = ° • 

With the definitions 

7 ° = /? , 7 *=$&i , * = 1,2,3 (3.8) 

this can finally be written in the form 

i ' s ( / ^ s+7l ^ r+72 ^ +7> ^)' i,_ " ,0 " / ’ =0 ' (3 ' 9) 

From now on we will write the matrices 7 ^ without the operator sign ~ ; likewise 
we will denote the four-dimensional radius vector (world vector) by x ; hence 

x =x , (3.10) 

which will simplify the notation in the following considerably and will scarcely 
lead to confusion. A more elegant formulation of the anticommutation relations 
(2.8) is possible using the 7 matrices. Because of (3 2 = H it is obvious that they 
now read 


7 a V+7 1 '7 /4 = 20 /u '1L , (3.11) 

“1” being the 4 x 4 unit matrix. The 7 * (i — 1,2,3) are unitary [( 7 *) 1 = 7 ^] 
and anti-Hermitian [( 7 ')^ = — 7 ']; indeed, 

( 7 ') 2 = -n = - 7 ' 7 't , * = 1,2,3 => ( 7 ') 1 = 7 ,+ • 


The fact that they are anti-Hermitian follows directly from (2.8) and (2.9) 

7 '* = = &i$ = — j d&i = — 7 ' . (3.12a) 

On the other hand 7 0 is unitary and Hermitian, 

( 7 0 ) 2 = +n = + 7 ° 7°1 => 7 °t = 7 0 . (3.12b) 


In the so-far-used standard representation the y 1 ' 1 can be written down explicitly 
using (2.13) 


7 = 


0 a' 
-d i 0 


7 


11 0 

0 -n 


(3.13) 


A further short hand notation is often convenient: It is the so-called Feynman- 
dagger notation, i.e. for example 

3 

4 = 7 ^ = g^A v = 7 °A° - £ y'A' = y°A° - 7 • A . (3. 14) 

1=1 

Another example is the nabla dagger : 
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u d o d , <9 7° <9 T-r 

^ 1 f)rt 1 f)r* - ' 

i = 1 


(3.15) 


dx^ dct dx‘ c dt 

i=i 

With this the Dirac equation (3.9) can be written in the very concise form 

(i/ty— moc)ip — 0 , (3.16) 

or, with = ihd/dx^, 

(fi-m 0 c)‘4> = 0 . (3.17) 


Introducing the electromagnetic potentials using minimal (gauge-invariant) cou- 
pling yields 

(p ~ ^4 ~ m oc^ ip = 0 . (3.18) 

Both and are four- vectors; hence the difference p^ 1 — ( ejc)A ^ is a four- 
vector too. While discussing the covariance in the following part we can thus 
confine ourselves to the free equations (3.16) and (3.17). 


3.1 Formulation of Covariance (Form Invariance) 


Covariance of the Dirac equation means two different things: 

(1) There must be an explicit rule to enable observer B to calculate his ip'(x') if 
tp(x) of observer A is given. Hence y' l {x') of B describes the same physical state 
as ip(x) of A. 

(2) According to the principle of relativity, which states that physics (i.e. the basic 
equations of physics) is the same in every inertial system, ip'(x') must be a solution 
of a Dirac equation which has the form (3.16) 

(i % /M ^7 - m 0 cj V>'(x') = 0 (3.19) 

in the primed system, too. Additionally the 7 '^ have also to satisfy the anti- 
commutation relations (3.11). This is a requirement of the principle of relativity, 
as otherwise A and B could distinguish their inertial systems. Therefore, 

7 oy" + 7 «yM = 1 (3.20) 


and 


7 '°t = y° (3.21a) 

7 ,ft = -7 rt , * = 1,2,3 . (3.21b) 

Within the sense of the principle of relativity these conditions of Hermiticity and 
anti-hermiticity must hold in all inertial systems, something we can also understand 
in the following: Let us first write (3.19) in the form 


ih'y ' 0 


dip'ix') 


-7 ,4 i/i 


d 

dx' k 


dct' 


+ moc ) ip f or 
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ifry /Q d^* ^ = (~n' k + + moc 2 ) ^ or 


Sx'* 


dt' 


- n y k i \h 


d 


dx 


Ik 


+ 7 '°m 0 c" 


<//(*') = //y 


(3.22a) 

(3.22b) 


The last form (3.22b) is Schrodinger type; the former (3.22a) is not Schrodinger 
type, because on the lhs the factor 7' 0 appears with the time derivative. Here we 
have used (7' 0 ) 2 = 1 from (3.20). The Hamiltonian H' from (3.22b), given by 


H' = -C7 /( V* 


ih 


d 

dx' k 


+ 7 ,0 m 0 c 2 


(3.23) 


must be Hermitian in order to have real eigenvalues 
{H') ] =H' . 

This is perhaps a more evident requirement of the principle of relativity because 
it ensures that both observers see real energy eigenvalues. Now the momentum 
operators 


are Hermitian and commute with the 7' matrices. Hence, both 7'° as well as the 
products 7 /0 7 //: from (3.23) must be Hermitian, which means that 

(y°)' = y° , 

(W)* = (V*)' (7*)' = (Y*)' Y° i YV 

, / /t\t /o ik 

=> (7 j = 7 7 7 = -7 

Again we have used the relations y /0 y /k = y lk 7'° and (7 /0 ) 2 = 1 which are 

included in (3.20) and we stress the point that only H' from (3.23) has to be 
Hermitian, but not 

H' = -ci K +m 0 c 2 

from (3.22a), the first part of which is indeed anti-Hermitian. Therefore, we have 

It can be shown by means of a rather long algebraic proof (see Example 3.1 for 
better understanding) that all 4 x 4 matrices 7'^ which satisfy (3.20) and (3.21) are 
identical up to a unitary transformation <7, i.e. 

7^ = , f/ t = U~ x . (3.24) 

Hence it follows, because unitary transformations do not change the physics, that 
without loss of generality we can use the same 7 matrices in the Lorentz system 
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of observer B as in the Lorentz system of physicist A, i.e. the matrices (3. 13). 2 
Therefore, we shall no longer differentiate between 7 '^ and 7 M , instead we rewrite 
(3.19) as 

(&' ~ me) = 0 , (3.25) 

where now 

d 

% = ih ^Q^ (3-26) 

holds. 


exercise 

3.1 Pauli’s Fundamental Theorem: The 16 Complete 4x4 Matrices t A 

Problem. Show that all representations of the Dirac algebra 7 M 7 *'-|- 7 1 ' 7 M _ 2 iff 1 ' 11 
for 4 x 4 matrices which satisfy 7 °* = 7 0 , 7 't = — 7 ' are unitary equivalent! 

Solution. The proof 3 is divided into two parts: 

(i) Proof of the fundamental theorem for Dirac matrices. For two four-dimensional 
representations 7 ^, 7 ^ of the Dirac algebra there exists a nonsingular 4x4 matrix 
S with 7 ^ = SjJ- 1 . 

(ii) If additionally 7o = 7o> 7* = ~~ 7^> 7o = 7o*> 7/ = — 7 ,^, then S can be chosen 
to be unitary. 

The proof uses the 16 following 4x4 matrices (A = 1, . . . , 16) (see also 
Chap. 5): 

A = n , 

70,171,172,173 , 

17273, 17371 , 17172, 7i7o, 727o, 737o , 

7i7273,i7i727o,i737i7o,i72737o , 

171727370 , (1) 


for which 

tl = \ (A = 1, . . . , 16) . (2) 

Let us denote hyi 727370 = 75- It can easily be proven that 75 anticommutes with 
In (ji = 0, . . . , 3), 

757/z + 7 m75 = 0 (ji = 0, . . . , 3) . (3) 

2 This fact can also be considered to be a consequence of the principle of relativity, since 
otherwise the different structure of the 7 matrices in the inertial systems of A and B would 
indicate to A and B in which of the systems they are. 

3 This argument stems from R.H. Good: Rev. Mod. Phys. 27, 187 (1955). 
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Now let us prove the fundamental theorem for Dirac matrices: 

(1) For all Pa but the H there exists a Pb with 

PbPaPb = —Pa ■ (4) 

We will specify for every Pa a corresponding Pb : 


A 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

B 

9 

4 

3 

3 

4 

5 

3 

2 

2 

2 

2 

6 

6 

7 

2 


One verifies easily by explicit calculation that (4) holds. 

(2) The traces of all Pa (A = 2, . . . , 16) are zero. From (2) and (4) 

-tr(A) =tr{f B r A r B ) =tv(rir A ) = +tv(r A ) . 

(3) The Pa are linearly independent. From 

16 


= 0 follows 

A=\ 


a A = 0 (A = 1,..., 16) , 

(6) 

because if we multiply the sum by Pb, 


a B H + oaPaPb = 0 

A+B 

(7) 


and 


4a B + Y, a A tr (P a Pb) =0 . ( 8 ) 

A±B 

Equation (1) implies that PaPb = const Pc, i.e. by means of the algebra a product 
of two matrices of (1) can be expressed in terms of another such matrix. In the case 
where A f B then P c P H; therefore the sum in (8) vanishes and consequently 
as = 0. This conclusion is valid for all B — 1, . . . , 16. The linear independence of 
the matrices P A is most important and will frequently be used in the following 

(4) Each 4x4 matrix can be expanded in terms of the P a , 

16 

X = J2 x aPa - ( 9 ) 

A = 1 

This is evident since the 16 linearly independent Pa generate a 16-dimensional 
space, that is the space of the 4 x 4 matrices. Then 

x B = \tr (Pbx) ■ ( 10 ) 


Exercise 3.1. 


(5) Each 4x4 matrix which commutes with all Pa is a multiple of 11 (Schur's 
Lemma). Consider the matrix 
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Exercise 3. 1. 


X = XbTb + ^ x A P A , ( 11 ) 

A^B 

where we have singled out a particular matrix Pb on which we shall focus our 
attention. We will show that xg — 0. With respect to (4) we first choose Pc such 
that if fulfills 

PcPbPc = —Pb ) ( 12 ) 

and, since x commutes with all P A and therefore also with Pc, 

x = fcxrc 

=> xbPb + ^2 x aPa = x bPcPb Pc + x a PcPaPc , (13) 

A^B A±C 

XbPb + J:X A Pa = -XbPb + Z^XaPa ) (14) 

A^B A^B 

due to the fact that Pc and P A commute or anticommute. Next we multiply with 
Pb, take the trace and obtain 


Xb = - xb • (15) 

Thus, xb = 0 as claimed. 

( 6 ) If ->n and 7 ^ are two representations of the Dirac algebra and P A , P A are the 
corresponding 16-dimensional bases, then 

P A S = §P A , (16) 

where 

S = ^PbFPb (17) 

B = 1 

with an arbitrary 4x4 matrix F. To understand this, consider the matrix 
16 

r A sr A = ^rr A r B Fr B r A . (is) 

B = 1 

According to (1) we have PbP a = £cPc with e c G {± 1 , ±i). For fixed A, if B 
runs from 1 to 16 then C takes all values from 1 to 16. This is so, since otherwise 
we would have 

PbPa = £cPc 

and 

P d P a =6 c P c ( PdPPb ) , (19) 

leading to Pg — ecPcPa = (£c/$c)Pd, which contradicts the linear independence 
of the P A . Since the P' A are constructed from the 7 ', in the same way as the P A 
from the 7 ^, we also have the relation 
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p f p f 
1 B 1 A 


e c r' c 

with the same £c ■ Inverting (20) yields 


r' r' — r' 

1 A 1 B ~ _ 1 C > 

£c 

and substituting (21) into (18) then gives 
16 , 16 


f'sr A = E = E r' c Pr c = s , 

C = 1 £c C = 1 


( 20 ) 


( 21 ) 


( 22 ) 


thus proving the relation (16). 

(7) The matrix F can be chosen such that S does not vanish. If S — 0 did hold for 
all F, then by a special choice of F such that a single element has the value 1, 
all the remaining elements being set to zero (i.e. F va = 1, all other F a p = 0), we 
could infer from (17) that 


16 


E WU = 0 


(23) 


B = 1 


which must hold for all u, a, due to the arbitrary choice of F. From the relations 
(23) for the various combinations v, u we infer the matrix equation 


16 


E( / '»Uft = ° ■ 


(24) 


B = 1 


Exercise 3.1. 


Since — 1, the cannot vanish simultaneously, so that (24) is in contra- 

diction to the linear independence of the P B . 

(8) The matrix F can be chosen such that S is not singular. To prove this lemma, 
we construct 

f = (25) 

b= l 

which arbitrary G, specified below. In analogy to point 6 we can show that 

r A t = tr' , ( 26 ) 

which, together with (16) leads to 

r A fs = tr' A s = tsr A , (27) 

i.e. ( TS ) commutes with all f)\ and is therefore, according to point 5, a multiple 
of the unit matrix 

fS=Kt . (28) 

Here we choose G such that f ^ 0 (cf. point 7). This enables us to choose F 
in (17) such that K p 0. This is so because the assumption K = 0 for all F would, 
with respect to (28) and (17), result in 
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Exercise 3.1. 


16 

y^tr' B Fr B =o . (29) 

B = 1 

With a choice of F in line with point 7, we have, from (29), 

E(^U(A)„=° or < 3 °) 

B = 1 

' (31) 

The do not vanish simultaneously since t p 0 and {/^} contain the unit 

matrix 11. Consequently this yields a contradiction to the linear independence of 
the t B . Thus S is not singular, and 

^ = S^S~ l , (32) 

completing part (i) of the proof. 

(ii) Now we will show that in case of 

7 1 = 9w In , 7? = 9nn 7m • ( 33 ) 

5 can be chosen as a unitary operator. To see this let 

^ = V^V-' , detV = 1 , (34) 

i.e. 

V = (det 5 ) ~ ' 5 . (35) 

Except for an arbitrary factor ±1, ±i, due to det V = det(±V) = det(±iV) = 1, 
the matrix V is completely determined. To assume in contrast to this that 

7; = V l7 ^r' - V 2lil V~ { , (36) 

would lead to 


V2 X V\1»=1,.V- X V X (37) 

and, with respect to point above, V 2 -1 Vi = k • I; hence 

V, =kV 2 , k = ±1, ±i . (38) 


Therefore, taking the Hermitian conjugate of (34), we obtain 

ii = (39) 

and, by means of (33), 

i.e. (Vt) -1 likewise fulfills (35) as does V . From (38), it follows that 


( 40 ) 



3.1 Formulation of Covariance (Form Invariance) 


137 


(V f ) = e im7r/2 V , 

y t _ e -im7r/2y-l ^ 

(41) 

j/ty _ e -im7r/2 


(42) 


Since the diagonal elements of the product of a matrix and its Hermitian conjugated 
counterpart have the form 

(M)u = Y,(*%( 9 h = 'E v fi v fi • (43) 

j j 

and V* Vp is positive definite and real, the factor e _im7r / 2 in (42) must be positive 
definite and real too, implying m = 0. Hence, 

= B , (44) 

which was to be proven. 


We will now explicitly construct the transformation between yXx ) and 4 >, ( x ')- 
This transformation is required to be linear, since both the Dirac equation as well 
as the Lorentz transformation (3.1) are linear in the space-time coordinates. Hence 
it must have the form 

ip , (x') = 'ip , (ax) = S(a)ip(x) = S(a)Tp(a- l x') , (3.27) 

where a denotes the matrix of the Lorentz transformation a" of (3.1) and 5(a) is a 
4x4 matrix which is a function of the parameters of the Lorentz transformation a 
and operates upon the four components of the bispinor (;Xx). Through a it depends 
on the relative velocities and spatial orientations of the observers A and B. The 
principle of relativity, stating the invariance of physical laws for all inertial systems, 
implies the existence of the inverse operator 5 -1 (a) that enables the observer A to 
construct his wave function ip(x) from the ip'(x') of observer B . Therefore, it must 
hold that 


4>(x) = S 1 (a)i//(* / ) = S '( a)f>'(ax ) . 

(3.28a) 

Because of (3.27) we can also write 


II 

II 

I 

(3.28b) 

and, comparing with (3.28a), we find 


S~ l (a) = S (a -1 ) . 

(3.29) 


Our aim is to construct S fulfilling (3.27-29). Starting from the Dirac equation 
(3.16) of the observer A, i.e. 

- m o c^J V’C*) = 0 , 


Exercise 3.1. 


and expressing ip(x) by means of (3.28a) yields 
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^itvy^S \a)J^ - mocS l (a)J ip'(x') = 0 . 

Multiplication with S(a ) from the left and using S (a)S (a) = 11 then gives 
d 


ihS(a)y»S («)^7 “ m o c ) = 0 


(3.30) 


With regard to (3.1) we transform d/dx^ to the coordinates of the system B, 
d dx w d „ d 


dx i 1 dx 1 ' dx ,v 

so that (3.30) becomes 


1 dx 


d 


ii/ 


ih (§ (a)^S \a)a v p) - m 0 c 


V>V) = 0 


(3.31) 


(3.32) 


This has to be identical with the Dirac equation (3.25), since form invariance of 
the equations of motion is required, i.e. S(a ) must have the property 


S(a)YS-\a)a\ = 7 " 
or equivalently 
S(a)7 I/ 5 _1 (a) = 


(3.33) 


(3.34) 


This is the fundamental relation determining the operator S : To find S means solv- 
ing (3.34) which holds for discrete as well as continuous Lorentz transformations 
since the above deduction does not depend on det = ±1. Once we have shown 
that there exists a solution S(a) of (3.34) and have found it, we will have proven 
the covariance of the Dirac equation. We may already now specify more precisely 
the definition of a spinor which, we have previously introduced somewhat inaccu- 
rately as a four-component column vector: In general , a wave function is termed a 
four-component Lorentz spinor if it transforms according to (3.27) by means of the 
fundamental relation (3.34). Such a four-component spinor is also frequently called 
a bispinor , since it consists of two two-component spinors, known to us from the 
Pauli equation. 4 

In determining S (a) we expect to deal with novel features which are not present 
in ordinary tensor calculus, since bilinear forms of x/j, e.g. the current four- vector 
of (2.18) 

{cgj k } with q = ' and j k =^a kf ip ; A: = 1,2,3 , 

have to transform like four-vectors; we will now discuss these properties. As we 
have already learned, 5 in general it is simplest to generate a continuous group 


See W. Greiner: Quantum Mechanics - An Introduction , 3rd ed. (Springer, Berlin, Hei- 
delberg 1994). 

See W. Greiner, B 
Heidelberg 1994). 


delberg 1994). 

5 See W. Greiner, B. Muller: Quantum Mechanics - Symmetries , 2nd ed. (Springer, Berlin, 
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transformation by constructing the group operators for infinitesimal transforma- 
tions and then composing operators for finite rotations, translations, etc., by con- 
necting the infinitesimal operators in series. Following the same pattern in our case 
of Lorentz transformations, we first construct the operator S(a). for infinitesimal 
proper Lorentz transformations given by 

(3.35) 

with 

Au^ = - Auj * v . (3.36a) 

The antisymmetric form (3.36) follows from (3.4) by neglecting quadratic terms 
[of the order ( Auj) 2 \. 

=61 = (6? + Auj^) {6° + ZW/) 

~ 6?6° + S^Auj^ + 6SAuP v = 61 + Ajj v ° + Auj% . (3.36b) 


Hence, 

Au u a + Auj c u = 0 , or g^ u ( Auj v a + Auj^A = 0 = Auj^° + Auj ail . 

Consequently, there are six independent non- vanishing parameters Auj ^ u . Each 
of these group-parameters (rotation angle in the four-dimensional Minkowski 
space) generates an infinitesimal Lorentz transformation. We will now give two 
examples pointing out the physical significance of the 

(a) Auj 10 = Auj 01 = —A0 ^ 0 , all other Auj* 1 '' = 0 . 

This implies that 

Auj\° = g] a Auj a0 = g\\Auj w = — Auj 10 = +Af5 = +AuP x = Aujq = — AuP\ 

and 

Auj' = g icr Auj cl = 0 for all * = 1,2,3 . 

According to (3.1) we then find 

(*T = + Au> l o6X + Auj\ 8" 0 6\) x» 

= (6Z-Af36X-A0656l)x’ t , 

or, explicitly, 

(jc ; )° = x° — A(3x l — x° - —x } , 

c 

(x 1 ) 1 = -Af3x°+x' = + , 

{x'f = x 1 , 

(x ') 3 = x 3 . 

Therefore the inertial system (') of the observer B moves along the positive x 1 axis 
relative to the system of observer A according to 


(3.37a) 


(3.37b) 
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(*')' = 0 


Av 

c 


= Ap 


This means that case (a) describes an (infinitesimal) Lorentz transformation for a 
motion parallel to the x' axis with a velocity Av = Ape, (see Fig. 3.3). 


*4 


*'4 


Fig. 3.3. The x' system 
moves relative to the x sys- 
tem with velocity Av = cA/3 


Av = cAf3 ^ 






(b) Auj\ = —Auu n — Au 2] = A(p , all other — 0 


According to (3.1) 

(x'r = K + + Wl(~A<p)] x* (3.38a) 



Fig. 3.4. An infinitesimal ro- 
tation of the coordinate sys- 
tem around the z axis 


or explicitly 
(x')°=x° , 

(x') 1 = x 1 + A<px 2 , 

(x') 2 = — Atpx 1 + x 2 , 

(x') 3 = x 3 . (3.38b) 

Clearly this transformation generates an infinitesimal rotation about the z axis by 
an angle A<p (Fig. 3.4). For finite rotations it would read 

(x') 1 = x 1 cos <p + x 2 sin <p 

(x') 2 = — x 1 siny? + x 2 cosy? . (3.38c) 


3.2 Construction of the & Operator 

for Infinitesimal Lorentz Transformations 

We now return to our original aim of determining the operator S(a ) = S (Aj m "), 
by expanding S in powers of Auj^ and keeping only the linear terms of the 
infinitesimal generators Aui tJ,u \ hence we write 

S (AO = 11 - X -o tat Au^ v , 

■§" 1 (AO = i + ^uAu^ , (£„„ = -*„„) . (3.39) 

The factors i /4 in the second term of the rhs are chosen such that the six coefficients 
u can be expressed in a convenient form, as will become clear later. Each of the 
six coefficients a iu/ is a 4x4 matrix, which is indicated by the operator sign " ! Of 
course, the same holds for the operator S and the unit matrix H. By finding the 
we can determine the operator S. The second relation (3.39) results from (3.29), 
which means that the inverse operator is obtained by substituting Aui ” — > —Au". 
Inserting (3.39) and (3.35) into (3.34) which determines 5, we find 
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(<5% + Au/) 7 ^ = - ^dapAuj* 0 ^ 7 M + ^aapAuj a0 ^j 

or, omitting the quadratic terms in Aoo^ u , 

Alj/ r = -'-Au,^ {& a p Y - Y& a0 ) • (3.40) 

Taking into account the antisymmetry (3.36) Au^ v = —Acj uti , the lhs of (3.40) 
becomes 

-\Au a * {a a0 Y - 7 u a a p) = g^Au^r = Au>» a <j'*'f 

= Au0 a (g v a l 0 ) = Au 0a {g v alp) 

= h Au 0a tfalp-g^la) 

= ~\Auf x0 {cj U alp ~ g V 0 la) • 

Hence, we end up with the relation 

-2i (g u alp - g U pla) = [&ap,l l ']_ ■ ( 3 -41) 

The problem of constructing S according to the fundamental relation (3.34) 
is now reduced to that of determining the six matrices a a p. Since a a p has to be 
antisymmetric in both indices, a and (5, it is natural to try an antisymmetric product 
of two matrices: 

&ap = ^ [lonlp\_ • (3-42) 

This form fulfills the requirement (3.41), which will be proven in the following 
exercise by taking into consideration the commutation relations (3.11). 

exercise 

3.2 Proof of the Coefficients & a p for the Infinitesimal Lorentz Transformation 

Problem. Prove that the a a p = i[ 7 a , 7 /j]_ /2 fulfill (3.41), 

[Y, O a p\ _ = + 2 i (g u alp - g v pi a ) • 

Solution. Inserting the expression for & a p in (3.41), we get 

[l V , &ap] _ = ^ [7". llonlp]-] _ = ^ { [7", lalp] _ ~ [l’' : Iplcc) _ } • 

By means of the algebra (3.11) it then follows that 

[ 7 ". Zap] _ = \ {2 [l V ,lalp\ _ - 2 [ 7 '', gap\ _ } = 1 [Yilap] _ ■ 
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Exercise 3.2. 


Furthermore we have 

i [ 7 ". la 7/?] _ = i {l V lal0 - 01 a + laYlp} 

= i {l U lal0 - Ig" 01a + 2g v Oc 'y 0 - l^la^} 

= 2i {g u a 10 ~ g V 0la) , 
which is the required form. 


According to (3,39), the operator S(A u^) for infinitesimal proper Lorentz 
transformations is now 

5 (Aon = I - ^Auj^ = 1 + l - [ 7 „, 7i/] _ . (3.43) 

The next step is to construct S(a ) for finite proper Lorentz transformations by 
successive application of the infinitesimal operators (3.43). To construct the finite 
Lorentz transformation of (3.1) from the infinitesimal one (3.35), we write 

Au\ = Au {I n )\ . (3.44) 


Here Au is an infinitesimal parameter of the Lorentz group [infinitesimal (gen- 
eralized) rotation angle] around an axis in the n direction. 0 n Y n is the 4 x 4 
matrix (in space and time) for a unit Lorentz rotation around the n axis. For the 
Lorentz rotation (3.37), which - as we already know - corresponds to a Lorentz 
transformation along the x axis with velocity Av = cAf. 3, we find 

a\ = 6" + Au>% = 6" + Au (l x ) u ^6^-Afi + 6^1) . (3.45) 


With Au = A(3, 

that only the matrix elements 
(/x)°l = (/x) 1 0 = -(/x)° I 



0 

-1 

0 

°\ 


-1 

0 

0 

0 


0 

0 

0 

0 


0 

0 

0 

0 / 


n V 10 

+ (7*) - 


(3.46) 


are different from zero. Besides, we can easily calculate the relations 


ft) 


2 


and 


ft) 


3 



0 

-1 

0 

On 


z 

0 

-1 

0 

°\ 


Z 1 

0 

0 

°\ 


1 

0 

0 

0 



-1 

0 

0 

0 


0 

1 

0 

0 


0 

0 

0 

0 



0 

0 

0 

0 


0 

0 

0 

0 


0 

0 

0 

0/ 


l 

0 

0 

0 

0/ 

Vo 

0 

0 

0/ 

z 1 

0 

0 

°\ 

( 

0 

-1 

0 

On 






0 

1 

0 

0 ' 


- 


0 

0 

0 






0 

0 

0 

0 


0 

0 

0 

0 

1 





\0 

0 

0 

0/ 

] \ 

0 

0 

0 

oy 

' 






0 

-1 

0 

0^ 












1 

0 

0 

0 



t 









0 

0 

0 

0 












0 

0 

0 

0/ 
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3.3 Finite Proper Lorentz Transformations 


The algebraic properties (3.47) for I x are of value, because we can use them, 
following (3.1) and (3.45), to construct the finite proper Lorentz transformations. 
Indeed we get by successive application of the infinitesimal Lorentz transformations 


( * T = (' + Ji 1 ’)" „ (' + Ji , *Y 

(” + W 1 *) 


. . . X 


VN 


— lim 

N —> oo 


v\ 
, N 


V2 


(The last summation index of the matrix multiplication has been renamed /j. in 
the last step.) 


= ('"'■) V" 

= (cosh (idl x ) 4- sinh(c uI x )Y x M 


„ , m *) 2 , M *) 4 , A , M , ( uix ) 3 , 

11 + ~ + — + ■) + [ir + + ••• 

id* ~ o Id* ~ o \ / Id ld J \ - 

1 + 2! (4r + 4[ (/ " r + --) + [u + 3\ + ' ]Ix 


J (1 


= [1 - dx) 2 + (cosh UJ){I X ) 2 + (sinha/)/*] 1 '^ 
or, explicitly, 


(3.48) 


f X n\ 

/ coshw 

— sinh u> 

0 

°\ 

/A 

x n 

1 -sinhcu 

coshw 

0 

0 


'x 1 1 

x 12 

“ 1 o 

0 

1 

0 


X 2 

Vx' 3 / 

V o 

0 

0 

1/ 


Vx 3 / 


(3.49a) 


or 


x'° = jt° cosh a; — jc 1 sinh a; = coshu/(je° — jc 1 tanhu/) 
x n = —x° sinh a; + x l cosh lj = cosh^(x ] — *° tanhu;) 


/2 2 
X — X 


x' 3 =x 3 


(3.49b) 


Now the finite Lorentz rotation angle uj can easily be related to the relative velocity 
v x — cp x of both inertial systems. Let us observe the origin of the x ! system and 
its motion from the x system, 

x n —0 — coshu/C* 1 — jc° tanhu/) . 

From this we can deduce the velocity of the x' system along the x axis of the x 
system. 

v x , , 

— = — = — = tann uj = p 
x° ct c 

and so by use of the relation cosh 2 cu - sinh 2 u = 1 we get 


(3.50a) 
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cosh lj = 


cosher 


1 


1 


\/cosh 2 lj 


sinh 2 lj y/\ — tanh 2 lj y/l—fP 
so that (3.49b) results in the known Lorentz transformations 




x° - (3x ] 

T^T 2 


X = 


X 1 -f3x° 


/2 2 
X U = JC Z 


/3 *3 

X = X~ 


(3.50b) 


(3.51) 


This procedure to construct Lorentz transformations can be generalized to motions 
in an arbitrary direction or to arbitrary spatial rotations. There exist six different 
matrices (generators) (In)" ^ corresponding to the six independent Lorentz trans- 
formations. They represent the four-dimensional genralization of the generators 
of spatial rotations known from nonrelativistic quantum mechanics. 6 We refer to 
Chap. 16 for a deeper discussion of the group theoretical structure of the Lorentz 
transformations. 


3.4 The £ Operator for Proper JLorentz Transformations 


Now we can construct the spinor transformation operator S (&) for a finite “rotation 
angle”. We start from the infinitesimal transformation (3.39) with the rotation angle 
(3.44) around the n axis and apply N of these infinitesimal transformations. 

N 


= e 


/V— >oo 

-(i/4 )wa lt AI n r'' 


1 Ul 


i>'(x') = S(a)ip(x) = lim 1 - ~» lu ,(l n r v ) 1>{x) 


4 N 


ip(x) 


(3.52) 


In particular for the rotation (3.46) which corresponds to a Lorentz transformation 
along the x axis we get 

ip'(x') = exp {-(i/4)w [W/x) 01 + v\oUx )'°] } ip(x) 

= exp{-(i/4)w [<3- 0 i(+l) + <7io(-l)]} ip(x) 

= exp [-(i/2)w6oi] i>{x) , (3.53) 

because following (3.42) we have aoi — — <Jio and following (3.46), (/, ) 01 = 1 and 
(7 X ) 01 = — 1. Analogously we can write for a rotation around the z axis of angle 
(p (to avoid confusion we use the notation i x ,I y ,I z for the Lorentz transformation 
into a moving system and h,h,h for spatial rotations, according to (3.28) and 
(3.44)), 


Au," = Av(h)\ 

where 


(h)\ = 


(° 

0 

0 


0 

0 

1 

0 

0 

-1 

0 

0 

u 

0 

0 

0/ 


(3.54) 


(3.55) 


6 See the discussion of angular momentum operators in W. Greiner, B. Muller: Quantum 
Mechanics - Symmetries , 2nd ed. (Springer, Berlin, Heidelberg 1994). 
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Thus only the elements (/3) 12 = —(h) 21 are non- zero, in which case we get 
= exp [-(\/A)ipa, lv {hT u ] 'i/’t*) 

= exp [-(i/4V {&n(h) 12 + hi (/ 3 ) 21 )] rKx) 

= exp [(i/4)v?(^i 2 (-l) + &21 (+1))] ip(x) 

= exp [(i/2)<£><7 12 ] ip(x) = exp [(i/2)v?a 12 ] rf)(x) , (3.56) 


corresponding to (3.52). Using the explicit representations (3.13) we obtain, ac- 
cording to (3.42), 


5.12 



- i - 2 

o'o“ 


A 2. " 1 

GO 


a1 a9 

< 7 *< 7 “ 


~ 9 xv 1 

< 7 “< 7 * 


2 f 2ia 3 0 N \_/<5' 3 

2 0 2ia 3 ) { 0 a 3 ) 


0 a x 
-a 1 0 


(3.57) 


Here a‘ are the well-known 2x2 Pauli matrices, in particular 


< 5-3 


1 0 
0 -1 


We recognize the similarly between the spinor transformation (3.56) and the 
rotation of the two-component Pauli spinor, 

<p'(x') = e m “- & cp(x) , (3.58) 

with which we are already acquainted. ; We can call the the covariant angular 
variables , because they arise in a similar way as did the components of the axial 
rotation vector u> for the spatial three-dimensional rotations. The existence of half- 
angles in the transformation law (3.56) is a result of the pecularity of the spinor- 
rotation laws. A spinor is first transformed into itself by a rotation of 4 tt, not 2 tt, as 
one might expect. Therefore, physically observable quantities have to be bilinear 
in the spinors ip(x) in spinor theory (Dirac theory), and hence they have to be of 
even order in the fields f ip(x). Only in this case do observables become identical 
under a rotation of 2n, a property of observables we know from experience. 

Following (3.52) and (3.56) the operator Sr for spatial rotations of spinors is 
given by 

S R (uJij) = exp [~(i/4)&ijU ,J ] , i,j = 1,2,3 . (3.59) 

Note that in the previous case 7 we subsequently used active rotations, whereas for 
the Lorentz transformations we have used passive rotations. It is unitary, because 
the (Jij (ij = 1,2,3) are Hermitian and 

7 See W. Greiner, B. Muller: Quantum Mechanics - Symmetries , 2nd ed. (Springer, Berlin, 
Heidelberg 1994). 
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Sr = exp (i/4)djw ,? ] = exp [(i/4 )diju ij ] = S R 1 . (3.60) 

For proper Lorentz transformations (e.g. transformation into a moving frame) this 
does not hold. For example, for a Lorentz transformation into a moving inertial 
system along the x axis (3.53) we find that 

S L = e -(i/2)wdo1 = e +(w/2)dl = S L + ^ S _1 L , (3.61) 

where we have followed from (3.42) 

1 1 A A A A 

<3oi = (7o7i - 7i7o) = ^ ~ Z^i/ 3 ) 

= ^(di +di) = iai . 

In Exercise 3.3 we shall prove that in this case 

S-'l = 1oSS (3.62) 

holds and further that, because of 

[7o,dy ]_=0 , 

we can combine (3.60) and (3.62). Hence, for any Lorentz transformation (i.e. for 
proper Lorentz transformations and for spatial rotations) we can write 

S~' = 705+70 ■ (3.63) 

exercise 

3.3 Calculation of the Inverse Spinor Transformation Operators 

Problem. Show that for S = exp (— the inverse operator is given 
by: 

5-' =70^+70 • ( 1 ) 

Solution. We prove (1) separately for transformations on a moving coordinate 
system and for spatial rotations. 

(i) Spatial rotations. For spatial rotations the time coordinate remains unchanged. 
So the components / 0/x , I 0 " of the corresponding generators vanish identically. 
Thus we can write 

S = exp , 

S ] = exp = exp 

because the 07 are Hermitian and 


( 2 ) 
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& l = 7/ ) + ~ C'yy'Tf) t } = “ {ijn - mi] = &ij • (3) 

According to (3.41) 70 commutes with 07 and thus 70 commutes with 5 t to 0. 
Hence we get 

705^0 = 5^=5-’ . (4) 


(ii) Transformation onto a moving coordinate system. First we rotate the coordinate 
system such that the boost coincides with the x direction. For this transformation 
we have 


5 - exp -o;<Toi J 

[see (3.61)] and 


= exp 

because 


exp ( --u;<7oi ) = S , 


°l\ = ~ \ {(7o7i) f ~ (7i7o) f } = ^(7i7o - 7o7i) = -001 • 


From 


700-qi = ^(7o7o7i - 7o7i 7o) = ^(7i7o7o ~ 7o7i7o) = 0-1070 = -00170 


(5) 


(6) 


(7) 


we get 

7 0 5 f 7o = 70 


n= 0 


S ( ■ "i^oi 


7o = X! 70 (“i^oi ) 7o 

n= 0 ' 


n= 0 y 


= 70 ( "^01 ) 7o7o ( "^ooi ) 7o • • - 7o ( “WOoi J 7o 


n -times 


w / i 

= ( +o w5 'oi ) = exp ( ^u& 0 i ) = 5 _1 


n=0 


( 8 ) 


Thus (1) holds for all Loren tz transformations S. 


Exercise 3.3. 


3.5 The Four- Current Density 

Next we prove the covariance of the continuity equation (2.16) and (2.17). Ac- 
cording to (2.18) the probability current density reads 

= { J°,j} = {o'0 t i/',o^ t o:i/)} = {cV^VM > 


i.e. 




(3.64) 
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This current density transforms under the Lorentz transformation (3.1) as 

;"V) = c^'VbVV’V) 

= cip\x)S^ 7°7 ,i 5 ip(x) 

— cip^(xyy°§~ lr y ,M Sip(x) [because of (3.63)] 

= ca M v ip^ (jc) 7°7 1/ ip{x) [because of (3.33)] 

= aV^x) (3.65) 


and is such recognized as a four -vector. This is the reason for calling j^(x) the 
four- current density. The continuity equation (2.17) can now be written in Lorentz- 
invariant form 


dj»(x) 

dx-‘ J 


(3.66) 


By this we have explicitly proven that the probability density 
j°(x) = cq(x) = cip ] ip (*) 


transforms like the time component of a (conserved) four- vector. Thus an invariant 
probability is guaranteed, because it holds for the Lorentz system of observer A 

J j°(x)d 3 x = 0 =► J j°(x ) db = 1 

and for observer B 

A [ j'\x') dV = 0 =» f j'\x')tfx' = 1 . 

For further considerations it is useful to introduce the short-hand notation 

^ = ^7° (3.67) 

for the combination 7^7°, which occurs very often, ip is called the adjoint spinor 
and is converted by Lorentz transformations as [using (3.63)] 

ip' {x') — ip'\ jc / ) 7° = (S)P(x)y 7 0 = ip^(x)S^ 7 0 

= ip\x)j°S- 1 = 0(jc)S _1 . (3.68) 


3.6 Biographical Notes 


LORENTZ, Hendrik Antoon, Dutch physicist, * 18.7.1853 in Amheim, {4.2.1928 in Haar- 
lem. Professor at Leiden, founded in 1895 the theory of electrons, with which he explained 
the Zeeman effect as well as the rotation of the plane of polarization of light in a mag- 
netic field. He gave, furthermore, a first explanation of the results of the Michelson-Morley 
experiment (L. contraction) and established the Lorentz transformation. Together with r. 
Zeeman he was awarded the Nobel prize in physics in 1902. 

FEYNMAN, Richard Phillips, * 11.5.1918 in New York, { 15.2.1988 in Pasadena, professor 
at the California Institute of Technology in Pasadena. F. developed the Feynman graphs for 
the mathematical treatment of quantum field theory. In 1965 he was awarded the Nobel 
prize in physics (together with J. Schwinger and S. Tomonaga) for the development of the 
theory of quantum electrodynamics. 



4. Spinors Under Spatial Reflection 


Next we explore the improper Lorentz transformations of spatial reflections, which 
are given by 

x' = -x , 

t' = f , (4.1) 

with a corresponding transformation matrix 
/I 0 0 0 

„ _ f 0 -1 0 0 

a » ~ l 0 0-1 0 

\0 0 0-1 

In this case the Dirac equation should be covariant too, because (4.1) is just a 
special case of the general Lorentz transformation (3.1). All of our considerations 
from Chap. 3 can therefore be used again here, except for those which are based 
on infinitesimal transformations. This is consequence of the fact that a spatial 
reflection cannot be generated by means of infinitesimal rotations acting on the 
identity element. Let us call the operator of the spinor transformation P (for parity). 
For S = P holds also the defining equation (3.34) which now reads with (4.2) as 

aP^ =P'fP~ x or 
a%a\^ = Pa a „'fP-' 

3 

v = 0 

^p-'^vp . (4.3) 

Notice that on the rhs there is no summation over a, because conventionally sum- 
mation is only defined over identical indices occuring simultaneously as subscript 
and superscript in an expression. Equation (4.3) has the simple solution 

P = e^7° , P- 1 = e _iv5 7° , (4.4) 

where for the time being <p is an unobservable arbitrary phase. It can e.g. be 
chosen in the following way. In analogy to the proper Lorentz transformations for 
which a rotation of 47r reproduces the original spinor, we postulate that four space 
inversions will reproduce the spinor, i.e. 

P 4 ip — ijj = e l4v ’( 7 °) 4 'i/> = e 14 ^ i 


= 9' 


flU 


(4.2) 


(4.5) 
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This implies that 
(e^) 4 = 1 
and thus 


e i<p — ±1, ±i . 

(4.6) 

The operator P given by (4.4) is unitary, 


P- ] =Q-^^ = P\ > 

(4.7) 

and, as will be seen shortly, also fulfills 


O 

*v 

II 

7 

(4.8) 


which is analogous to (3.63). We are now able to write down explicitly the trans- 
formation of the spinor under space inversion, [cf. (3.27)] 

= PMx) — e lv, 7 °'ip(x,t) 

= Pip(-x',t') . (4.9) 

In the nonrelativistic limit 


ip(x) = 


<t>{x) 

0 


and thus <j> is an eigenvector of P . For an electron at rest [see (2.71)], 

Pip (l) = e iv V (1) , 

Pip® = e 1< > 5 i// 2 ) ( 

Ptp^ = ; 

Pi) w = -e' v ip w . (4.10) 

Therefore the eigenfunctions of positive energy 7 (1) and y/ Ti have a P eigenvalue 
(“inertial parity”) opposite to that of the states with negative energy p y> and ip (4) . 

Another improper transformation is time reversal but since this is more compli- 
cated we shall deal with it later, in Sect. 12.5. The parity of spinors is also discussed 
in Example 9.3 and is further illustrated in Sect. 12.5. 



5. Bilinear Covariants of the Dirac Spinors 


There must exist 16 linearly independent 4x4 matrices, which we denote by 
( r n ) a (j . It turns out that one can construct 16 (a complete set) of these P n (n = 
1, . . . , 16) from the Dirac matrices and their products. We write 

(T) 1 ^ (4) 7m ’ (6) a>lU = ’ 

f P (T) i 7 V7V-75=7 5 , /J = 757m (5-0 

and verify step by step the postulated properties of the P n as well as some extra 
ones (also cf. Example 3.1). First we shall prove that in (5.1) there are indeed 16 
matrices. This is easily done by adding the values written in brackets below the 
symbols. The upper indices of the matrices (“S”, “V”, “T”, “P”, and “A”) have the 
meaning “scalar”, “vector”, “tensor”, “pseudovector”, and “axial vector”, and 
these specifications will become clear in the following. Furthermore it holds that: 

(a) For each P n holds (P n ) 2 — ± H. We shall prove this for some of the t n : 

(fs) 2 =i . = m 2 = i»,, . < 5 - 2 > 

due to the commutation relations (3.11), 

{tlvf = °lv = ~|( 7 m 7 * - 7 * 7 m ) 2 

= -\ilixlvltxlu - 7 m 7 * 7 * 7 m - 7 * 7 m 7 m 7 * + 7 * 7 m 7 * 7 m ) 

= -3 [ 2 5m*7m7* - (7m) 2 (7*) 2 - 2( 7 m) 2 (7*) 2 

T ^ 9 ~ (7*) (7m) ] 

= — 4 [23m* (7m 7* "f” 7*7 m) 4(7*) (7m) ] 

= [4 (3m*) 2 11 - 4 3**3mm n ] 

f 3 m M H foxppv 

| 0 for p, = ia In this case < 7 M „ = 0 . 

Similarly the conjecture is proven for the other matrices (5.1). 

(b) To each P n except P s there exists at least one corresponding P m with 

pn pm _ _pmpn 

This is easily proved explicitly, e.g. 

P V »--P V »P? = ~P?n if v + P 


(5.3) 
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since this corresponds to the commutation relation (3.11) 
pj . ivr r.v _ _ fw fvr 

1 /XIX * 1 pUS 1 /X 1 /X 1 /XIX > 

because for each /i, ^ we have 


i i 

OvlvIix = 2^71/ - + 27 m 5 mi/ + 7 M 7„7 M - l^g^) 

This shows that the P^'s correspond to the P^ v in the sense of relation (5.3), and 
vice versa, etc. For the other r n matrices one proves the relation (5.3) in a similar 
way. From (5.3) and (5.2) in particular, 

-±-p n _ pm pn pm p n 

and therefore calculating the trace yields [making use of tr(AB) = tr(BA)] 

± tr(f n ) = - tr (P n (P m f) = + tr (f"(f m ) 2 ) = 0 . (5.4) 

Here we have a remarkable result: All P n matrices except P s have a vanishing 
trace ; 

(c) For given P a and P b {a p b) there exists a P n p P s with 

papb =f n b pn > ( 5 . 5 ) 


so defining f£ b as a complex number. We check this property with some examples: 
For p p v we have 


pVpV _ 

1 u 1 v — in iv 


JnJu-lu 7 m 


= -iv = -i 


In this case the / factor is — i. As a second example we take a look at 


{ -ilodmn = -i 9nn^a for P = t , 

+17 oQnn = f° r p = o , 

p p,a,T for P p t p a , 

etc. It can easily be shown that (5.5) holds in each case. 

Now we show that the P n ' s of (5.1) are linearly independent. For this purpose 
we suppose the existence of the relation 


^a n P n =0 , (5.6) 


and by multiplication with P m p P s we get 

0 = S n a n tr (P n P m ) =a m tx ((. P m f ) + E n+m a n tr (C^) [cf. (5.5)] 
= a m tr {(P m f ) +S,^ m aJP m tr (f") 

= ±J [cf. (5.2)] =0 [cf. (5.4)] 

— zb 4(2 m 


(5.7a) 
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Thus a m = 0 for all m ^ S, and in the case of t m = t s 

° = tr (^a„f s r'’] =«s tr(f s ) 2 tr(f”) =0 , (5.7b) 

\ n / V v ^ n^S ^ v ^ 

=4 [cf. (5.1)] T =0 [cf. (5.4)] 

i.e. a s = 0 also. Therefore all coefficients a n of (5.6) must vanish and the linear 
independence of the t n ’s is proved. Hence every 4x4 matrix can be expressed 
by the t n ’s. 

We now turn to the behaviour of the bilinear expressions 
'(p(x)r'ip(x) (5.8) 

under Loren tz transformations. For this purpose we need 

7 m 75 + 7 s7 m = 0 , (5-9) 

which may be easily verified and leads immediately to 

[ 75 , ^ (isilulu - lulu) - ilu,lu - lulfjdls) = 0 • (5.10) 

According to the proper Lorentz transformations (3.52) the spinors are transformed 
by the operator 


S(a) = exp = exp (-^< 7 ^( 7 ,,)^ . (5.11) 

Together with (5.10) we get directly 

[5(«),7s]_=0 • (5-12) 

According to (4.9) the space inversion (improper Lorentz transformation) of the 
spinors is accomplished by the operator 

P = 7 ° . (5.13) 

For this operator, by virtue of (5.9), 

P 7 5 = - 75 F or (5.14a) 

[P, 7 5 ] + = 0 . (5.14b) 

Now we consider bilinear quantities in ip and ip and can easily calculate that, e.g. 

ip' {x')ip'(x') — ip'\x')i°ip\x') — ip\x)S^ / y°Sip(x) 

= ip^(x)i°S~'Sip(x) — ip^x^ipix) = ip(x)ip(x) . (5.15) 

Thus the bilinear expression ip(x)ip(x) has the same value in every Lorentz frame. 
We therefore call 

ip(x)ip(x) = ip(x)r s ip(x) (5.16) 

a scalar under Lorentz transformations (or simply Lorentz scalar ) and ip(x)ip(x) 
the scalar density. Similarly, in Exercise 5.1 we prove that 
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= ip(x)S l ^ 5 Sip{x) = det(a)ip(x)j 5 ip(x) (5.17a) 

is a pseudoscalar, 

ip\x')YY{x') = a 1 ' (5.17b) 

is a vector , 

'<p' (x'Yl 5 l v 4 >'(x') = det(a)a" ^(xyf 5 y fl 'ip(x) (5.17c) 

is a pseudovector , and 

ip'ix'^ipfx') = 0rkx)a afi ipOc) (5. 17d) 


is a tensor of second rank. The prefix “pseudo” in pseudoscalar indicates that this 
quantity transforms as a Lorentz scalar but reverses its sign under improper Lorentz 
transformations. The same holds for the pseudovector (5.17c). 


EXERCISE 


5.1 Transformation Properties of Some Bilinear Expressions of Dirac Spinors 

Problem. Investigate the transformation properties of the following bilinear ex- 
pressions consisting of Dirac spinors: 

(1) ipip , (2) i/ry 5 ip , (3) t/’T/xV’ , (4) ihsltiip , (5) ipo^ip . 

Solution, ip changes to Sip under Lorentz transformations and ip to ipS~ l (since 
= 7o5 -1 ). It holds that S~ l = a^ u 7 " and S 75 = 7 5 S det|a|. The latter 
relation is a useful summary of (5.12) and (5.14). Thus, 

( 1 ) Ipip ^fpS~ x Sip 

= 'ip'ijj (scalar) , 

(2) ipj 5 ip — ► ipS~ x isSip 

— i/'det |a| 5 _ 1 575'0 
= det |«|- 075 ^ (pseudoscalar) , 

(3) fp^ip -^fpS~ x ^Sip 

= a (l u ip*( v ip (vector) , 

(4) ip'fs'y^'ip — ► ipS~ l y 5 y^Sip 

= det \a | ipjsS ~ Xr y^Sip 
= det \a\a^ u ip^$Y ^ (pseudovector) , 

(5) 4>&^ip ^fps-'d^sip 

= ( 7 ^ 55 -1 7 ^ - 7 ^ 55 _1 7 ^) Sip 

= ^ {a ti e i e a u T Y - a" t j t e Y) 

= e a u T ip& eT ip (second-rank tensor) . 
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EXERCISE 

5.2 Majorana Representation of the Dirac Equation 
Problem. Show that there exist four 4 x 4 matrices A such that 

A*A> + r y r^ — , 

(i hP^d^ — moc ) tj} = 0 , 

i.e., the Dirac equation is real in this representation. Here = d/dx^. 
Solution. From the common 7 M matrices we know that they fulfill 


7o 

with 


o x = 


n o 
o -n 


0 1 

1 o 


7 ; 


0 < 7 , 

-&i 0 


,<?2 = 


0 -i 

1 0 


cr 3 = 


1 0 
0 -1 


and 


lulu + lulu = 2g»u , To = To 
We define 

' 0 <J 2 


7 , = - 7 ; 


A = ToT 2 


<72 0 


f, ■ ■( o ox 

r, =1707, = o 

f, . ./I 0 

7 2 = 170 = 7 o -1 

0 <73 

<73 0 


A = 17073 = i 


and recognize immediately that these A are purely imaginary matrices. On com- 
plex conjugation 

A* = 7 o * 7 2 * = -7072 = -A , 

A* = -iToTi* = "17071 = “A , 


A* = -170 


-A 


A* = -17073 = -A 


since only 72 is imaginary. Now we check the anticommutation relations: 
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Exercise 5.2. 


9oo = 1 — (A) = 7o727o72 
= ~(7o) 2 (72) 2 = 1 , 

2<?oi = 2^,0 = 0 = -i(727i + 7i 72) 
= ~i [(7 o) 2 727i + (7o) 2 7i72] 



= -1(70727071 +70717072) 





= A A + A A , 




2<?02 

= 2520 = 0 = i(72 - 72) = i(7o7o72 

+ 707270) 


= A2A) + A0A2 , 




2<7o3 

= 2530 = 0 = A0A3 + A3 A> 

as for 

501 

y 

011 

= -1 = (7o) 2 (7i) 2 = - 707 i 7 o 7 i 




= 07 o 7 i ) 2 = (A) 2 , 




2012 

= 2521 = 0 = AAi + A>A 

, as 

for 

920 

2 < 7 i 3 

= 2531 = 0 — A1A3 + A3 A] 

, as 

for 

9 \o 

922 

= -1 = (170) 2 = (A 2 ) 2 , 




2<723 

= 2532 = 0 = A2A3 + A3A2 

, as 

for 

920 

533 

— — 1 — (A 3 ) 2 , as for 

0ii • 




Thus we have proved all of the necessary properties of the P^ matrices. This P ^ 
representation of the Dirac algebra is called the Majorana representation. 


5.1 Biographical Notes 


MAJORANA, Ettore, *05.08.1906 in Catania, dropped out of sight in 1938, went to 
the classical secondary school of Catania until the final examination in 1923. Afterwards 
he studied engineering sciences in Rome until the beginning of the last year of studies. 
1928 transfer to the physics faculty (own desire) and in 1929 Ph.D. in theoretical physics 
at Fermi’s. Title of the thesis: “Quantum Theory of Radioactive Atomic Nuclei”. In the 
subsequent years free-lance collaborator at the Institute of Physics in Rome. In 1933 he 
went to Germany (Leipzig) for some years and worked with Heisenberg. This resulted 
in a publication on nuclear theory [Z. Phys. 82, 137 (1933)]. In 1937 he published “The 
Symmetric Theory of Electron and Positron” and four years after his disappearence the 
“Significance of Statistical Laws for Physics and Social Sciences” was published. 



6. Another Way of Constructing Solutions 
of the Free Dirac Equation: 

Construction by Lorentz Transformations 


Our considerations in the last chapters showed that the free Dirac equation exhibits 
all of the properties of covariance. Moreover positive-energy solutions possess the 
correct behaviour in the nonrelativistic limit. We consider now the solutions of the 
free Dirac equation with a new approach better suited to the covariant formulation 
and, later on, for the field-theoretical apllications. Let us return to the solutions 
(2.71) of the free electron in its rest frame. We denote them in a more compact 
form: 

V> r = u r (0)c-' £ ' (mcl/h)t , r — 1,2, 3, 4 , (6.1) 


where 


( +1 for r = 1,2 
{ -1 for 3, 4 . 


( 6 . 2 ) 


The x dependence of the spinors ip r in (6.1) reduces to a simple time dependence. 
There is no space dependence because the wavefunction is smeared out homoge- 
neously in the whole of space. We also have 


cA0) = 


f l \ 

0 

0 

Vo/ 


u, 2 (0) 


co\0) = 


/°\ 

0 

1 

Vo/ 


u\0) 


f°\ 

0 

0 

Vi / 


(6.3) 


The first and the second solutions ip l (x) and ip 2 (x) have positive energy and cor- 
respond to the spin degrees of freedom of the Schrodinger-Pauli electron. For the 
solutions iJj 3 (x) and ip 4 (x) with negative energy we must still find a reasonable 
interpretation. All of the ’ip r (x) of (6.1) are also eigenfunctions of 


£3 = £12 = 


<73 0 

0 <73 


(6.4) 


[cf. (3.57)] with the eigenvalues ±1: 

E^ r {x) = {±\W{x) , 


(6.5) 


where the eigenvalues 

+ 1 are valid for r — 1, 3 and — 1 are valid for r =2, 4 

Earlier [cf. (2.34)] we directly obtained the free solutions for finite momentum 
(finite velocity) by solving the free Dirac equation. Now we follow a different 
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Fig. 6.1. In the x't' system 
the electron moves with the 
velocity +v. In the xt system 
the four-momentum is p^\ in 
the xt' system it is p'^ 


P°A light cone 



Fig. 6.2. The four-momentum 
of a free particle always lies 
within the light cone in mo- 
mentum space 


path: By transforming to a coordinate system which moves with the velocity - v 
relative to the rest system , the free wave functions of the electron with velocity +v 
are constructed from the wave functions (6A) of the electron at rest (see Fig. 6.1). 
For this purpose we write first the exponent of the rest solution (6.1) in invariant 
form: 

e -i 6r(m 0 c 2 /h)t _ e -i e r ip^x h h) _ ^-ierip^x'^h) 




— V 


rest system 


where 


x 9 ' L = a' k u x v 


\ electron at rest 


p ,fl = a^ v p u = a^oP 0 = a^omoc 


P " = {m 0 c,0} = 


(6.7) 


The primed quantities are valid in the moving reference system, the unprimed in the 
rest system of the electron. The zero-component of the four-momentum is always 
given by 

P° = ~ = +\/ p 2 + m V 2 > 0 • ( 6 . 8 ) 

Next we remark that the solutions for positive and negative energy transform sep- 
arately under proper Lorentz transformations and also under space inversion. This 
means that the solutions for positive and negative energy are not mixed under 
these transformations. One recognizes this from (6.6): The four-momentum of the 
free particle = mfic 2 > 0 is always timelike. Therefore, for free particles 

p M lies within the light cone in momentum space (Fig. 6.2). Under proper Lorentz 
transformation and also under space inversion (but not under time inversion), the 
future and past cones (i.e. the vectors with p° > 0 and p° < 0, respectively), and 
with them the solutions for positive and negative energy, remain strictly separated 
from each other. 

We transform the spinors w r according to (3.61) with the operator 

S = q-^oi/2 ( 6 . 9 ) 

if, for convenience, we choose the velocity parallel to the x axis. According to 
(3.50a), 

— = tanh uj . 
c 

Since in our case v x should be negative (the moving inertial system has the velocity 
v = — v x e\ in the rest system), this relation is now 




6. Solutions of the Free Dirac Equation: Construction by Lorentz Transformations 


159 


v x 

— tanh l o 

c 


u = tanh" 1 =- tanh- 1 . 

Therefore (3.49b) and (3.51) now read as 

x° = coshw (jc° — tanhwjc 1 ) = . (x° + — jr 1 ) 

x l = coshw (x l — tanhwx 0 ) = — . (x l 

v ' J\-PP. \ 


(6.10a) 

(6.10b) 


x „o 


V 

+ — * 
c 


( 6 . 11 ) 


and the operator (6.9), with &o\ 
— i 7 ° 7 °Q!i = — i&i, becomes 
c __ p -i(cj/2)<3- 0 i _ ~-(lj/2)6h 


\ (7o7i ~ 7i7o) = i7o7i = -i7°7‘ = 


(-w/2)& 1 | (-w/2 ) 2 (gQ 2 | (-w/2 ) 3 (dQ 3 | 


1! 


= | 1 , (u/Z) 2 | (u/2) 4 | 


2 ! 


2! 


4! 


3! 

„ . w/2 (w/2) 3 

i- 1 -b- + + 


1! 


3! 


ai 


u 


11 cosh — ai sinh — 
2 2 


( 6 . 12 ) 


Here we used aj = 1, according to (2.8). Now the spinor transformation (3.53) 
can be applied directly to the u/(0) from (6.1) to obtain 


oj r {p x ) = e -UtJ/z,CT01 o/(0) = 


-i(u)/2)& 0]fiJ r 

( 


u 


„ „ w\ 

cosh — — «i sinh — J u/(0) 


= cosh 


u> 


1 
0 
0 

V - tanh 


0 

1 

• tanh j 
0 


0 

tanh j 
1 
0 


■ tanh f \ 
0 
0 

1 ) 


w'(0) . (6.13) 


Because the w r (0) have the simple form (6.3), the rth column of this transfor- 
mation matrix (6.13) is identical to the spinor u r (p). More precisely, 


/ 


UJ 


1 IX/ 

U (p x ) = cosh- 


1 

0 

0 


\ 


UJ 


u ( p x ) = cosh - 


UJ 


UJ ( p x ) = cosh- 


V — tanh j J 

0 \ 
-tanh j 

1 

0 J 


0 

1 

tanh j 


tanh j 


ui 


u (p x ) = cosh - 


0 

0 

1 


(6.14) 


To return to physical quantities, we convert the rotation angle w with the aid of 

( 6 . 10 ), 

cosh x sinh x = ^sinh2x , 
cosh* cosh* = |(cosh2x + 1) , 

sinhxsinhx = ^(cosh2x — 1) , 


(6.15) 



160 


6. Solutions of the Free Dirac Equation: Construction by Lorentz Transformations 


and therefore 
tanhx = 

Since 
sinhx 


tanh 2x 


sinh 2x 

cosh 2x + 1 1 + 1 / cosh 2x 


= tanhx — 


\J cosh 2 x — 1 


coshx coshx 

and hence (6.16) becomes 

t tanh 2x 

tanhx = 


then coshx 


1 + \/\ - tanh 2 2x 
With (6.10) we may now calculate 


tanh u 


Vx/C 


(jO 

■ tanh — = — _ — — 

^ 1 + yl- tanh 2 u 1 + \/l — ( v x /c ) 2 


PxC 


and 


U) 

cosh - = 
2 


[/MqC 2 /a/1 - (v x /c) 2 ](v x /c) _ 

[m 0 c 2 /y 1 - {v x /c) 2 } [1 + Vl - (v x /m ~ E + m 0 c 2 
1 


y 1 — [tanh 2 u>/(l + \/ 1 — tanh 2 u>) 2 ] 
1 

\/l - {(v x /c) 2 /( 1 + - ( V x /c ) 2 ) 2 ] 

i + yi - ( vx/c p 

\J 1 + 2 \/l - (V x /c) 2 + (1 - (v x /c) 2 ) - (v x /c) 2 

1 + V 7 ! - fe/c) 2 

v^l - fe/c ) 2 + V 7 !- (Vt/c ) 2 
[(l/v 7 ! ~ ( Vx/c ) 2 ) + l]m 0 c 2 
v^i + [l/v 71 - me 2 

E + moc 2 


E + moc 2 


(6.16) 


V 7 ! — tanh 2 x 


(6.17) 


(6.18) 


(6.19) 


y/m 0 c 2 + E y/2m 0 c 2 V 2m ° c2 

Hence by (6.18) and (6.19), the transformation (6.13) can be expressed completely 
by physical observables (E, p x , m^c 2 ) and reads 


U r (p x ) - 

( 


jE 4- m 0 c 2 
IrtlQC 2 

1 

0 


0 

PxC 


0 

1 

PxC 


0 

PxC 


PxC 


E + moc 2 


E + rtiQC 2 
0 


E + mac 2 
1 


E + rriQC 2 
0 


0 


0 

1 


u/(0) . (6.20) 
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The result (6.20) can be generalized to plane waves with velocity v in arbitrary 
directions. We write 


v = vn = v {cos a, cos /3, cos 7} ; 


( 6 . 21 ) 


thus n is the unit vector which specifies the direction of motion of the moving 
inertial system with respect to the rest system of the electron. Then according to 
(3.44) the “rotation angles” are 


Au% = Au{I n )\ = Au 


0 — cos a 

— cos a 0 

— cos /3 0 

— cos 7 0 


cos /3 — cos 7 

0 0 

0 0 

0 0 


( 6 . 22 ) 


^ H 1 — - C OS/3 0 0 0 ' 

V — cos 7 0 0 0 / 

and with that, according to (3.45), the infinitesimal Lorentz transformation reads 
x /l/ = = x" + Au%x» 1 (6.23) 

or, in expanded form, 

x f0 = x° — Au cos ax 1 — Au cos (3x 2 — Au cos 7X 3 , 

x n = — Au cos ax 0 -f jc 1 , 

x a = — Au cos px° + x 2 , 

x' 3 = —Au cos ^x° + x 3 . (6.24) 

Since x f = 0 = {jc /] , jc / 2 , jc / 3 } characterizes the origin of the moving inertial 
system, it follows immediately that it moves with 


Av = {Av\, Av 2 , Avt,} = c 


jc 1 jc 2 jc 3 


JC°’ JC°’ JC° 


= cAu {cos a, cos /?, cos 7} (6.25) 


relative to the rest coordinate system. From (6.24) easily follows the former special 
case of motion along the x axis if we set cos a = 1, cos/3 = 0, cos 7 = 0. Just 
as simply, one obtains the movements along the y and z axes for cos a = 0, 
cos/3 = 1, cos 7 = 0 and cos a = 0, cos/3 = 0, cos 7 = 1, respectively, both of 
which are also contained in (6.24). Thus from (6.22) follows the coefficient matrix 
belonging to the direction of motion n. 

/ 0 — cos a — cos (3 — cos 7 \ 

f f -cos a 0 0 0 | 

(/ ” ) ' = -cos/3 0 0 0 ( ’ 

V - cos 7 0 0 0 / 


(6.26) 


Setting Alj = u/N we get, in complete analogy to (3.52), 


ip\x') = S(a)xp(x) 




= exp (— iwa M „(/n) M1/ / 4) ip(x) ; 


(6.27a) 


S(a) = exp (-iw^(/ n r74) 


(6.27b) 
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The expression in the exponent reads 

0^(lnT V = 2 (<7oi (Inf 1 + <M/n)° 2 + ^Onf ) , 

which can be converted by use of (6.26) and g n — g 22 = g 33 = — 1 into 

a y 

— 2(<J 0 1 cos a +' <j 02 cos/? + <J 0 3 cos 7) = -2i& • - . (6.28) 

V 

Here we have made use of 
v 

— = {cos a, cos (3, cos 7} 
along with v = |v| and 

£0; = ^[7o,7/]- = ^(7o 7/ - 7/7o) = 1707/ 

= i7°7 l 9u = -i7°7' = -*7° 7% = -i &i ■ (6.29) 


With this the spinor transformation (6.27b) for Lorentz transformations to inertial 
systems with direction of velocity v/v now becomes 


S(-v) = S (-—) = 


1 


I E + m 0 c 2 
Itn^c 2 


0 

PzC 

E + moc 2 
P+c 

L E + moc 2 


0 

1 

P-c 

E + moc 2 
~Pz c 
E + moc 2 


PzC 

E + moc 2 
P+c 

E + moc 2 

1 

0 


P-c -| 

E + moc 2 

~PzC 

E + moc 2 
0 

1 


= [u'(p),0J 2 (p),u 3 (p),u 4 (p)] 


(6.30) 


Here we have set p± = p x ± i p y , and the last line indicates that the individual 
column vectors of the S(-v) matrix are identical with u/(p). We will calculate the 
final step of (6.30) in Exercise 6.1. 


EXERCISE 


6.1 Calculation of the Spinor Transformation Operator in Matrix Form 

Problem. Calculate explicitly the spinor transformation operator 

S(~V) = Q-^I 2 ^-Vlv 


in matrix form. 


Solution. We expand S in a series 

c { \ . id & - V 1 U 2 2 lw 3 ,. , 


( 1 ) 


and use 
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(a • v) 2 = a 1 cJ ViVj — 7 ° 7 * 7°7 ^ ViVj — —7 viVj 

= ( iiviVj + iivjVi) = -\ {2g ,J lviVj) 

= Wi . 


( 2 ) 


This yields 

~ ( lw 2 (v\ 2 1 w 4 / v \ 4 

S(-®) = h + — (-) + S1 g(-) + 


& • v ( u la; 3 fv\ 2 
V {2^68 \ v/ + 

„ t U a-V.U ) 

— H cosh — sinh — . 

2 v 2 


( 3 ) 


The matrix & -v/v has the form 
& • v 


= 


v x . « v y . . 

h Qv • h Oi 7 



. l- 


. . -1- 

Ei 

. . l . 

+ \Py 

. 1 

p 

l . . 

P 

. -1 . 


.1 . . .. 


.1 




1 

p 


-1 


Pi p- 
P+ ~Pz 


Pi p- 
Lp+ -Pi 


( 4 ) 


where p± = p x ± i p y . 

With the help of (6.19) for cosh(w/2), we obtain from (3) that 


S(-v) = 


I E + m 0 c 2 
2rtioc 2 


1 


IE + mac 2 tanhu>/2 
2moc 2 p 


( 1 

0 
PlC 


I E 4 - rriQC 2 
2moc 2 


lJ 


Pi P- 
L P+ ~Pz 

0 


P~c 


Pi P- 
P+ ~Pi 


PlC 


P~c 


E + moc 2 E + moc 2 

P+c ~PzC 

E + moc 2 E 4- moc 2 


E 4- ttiQC 2 E 4- moc 2 

P+C PlC 

' E + moc 2 E 4- niQC 2 


1 

0 


0 

1 


( 5 ) 


Exercise 6.1. 
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Exercise 6.1. 


According to (6.18), 



pc 

E + ntQC 2 


( 6 ) 


taking into account that in (6.18) p was substituted by p x , because there we only 
considered motion in the x direction. Hence the matrices in (5) can be summarized 
by the expression (6.30). 


EXERCISE 


6.2 Calculation of the Spinor u(p + q) by Means of a Lorentz Transformation 
from the Spinor u(p ) of a Free Particle 

Problem. Given a free-particle spinor u(p), express u(p + q) by u(p) for — » 0, 

pq — * 0 by means of a Lorentz transformation. 


Solution. The solution of the Dirac equation for a free particle with momentum p 
reads as 


ip(x) = u(p)e ip ' x . 


( 1 ) 


If we transform into a system x' , which moves with the velocity u = -q/m in 
the system chosen in (1), the particle (in system x') has the momentum p + q, and 


</>'(*') = u(p + q)e- i( P + ^- x ' . 

Because of the invariance of the scalar product p 


e ~i P’X _ e -i(p+q)-x’ 


From (3.52) we obtain the transformation of into the moving system 


?//(x') = exp [ . 


By inserting (1) and (2) into (4) we get the general result 


u(p + q) = exp ( ) u(p) . 


For — > 0 the exponential function can be expanded; 


exp ( = 1 - 


( 2 ) 


(3) 


(4) 


(5) 


( 6 ) 


If we choose the coordinate system in (1) with q pointing in the x direction and 
with pq = 0, (6) can be simplified as 

f i \ ^ iuj 

exp I --coaoi 1 ^1- —a 0 \ , 


( 7 ) 
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where the connection between u and q — ( 0 , q x , 0 , 0 ) is 


, v Qx 

tanh uj = - = ~ u) 

c me 


Therefore one obtains 


u (p + q)= + 

=(* + 


=<i+ 


1 q x 

2 me 
Jh_ 
4mc 

qx_ 
2 me 


coi J u(p) 

(7i7o - 7o7i)) «(P) 

. -1 


-1 


u(p ) 


Ul(p ) - 
U 2 (p) - 
« 3 (p) - 

\u 4 (p) - 


Jh_ 
2 me 
Jh_ 
2 me 
Vx 

2 me 
qx 

2 me 


u 4 (p)\ 

«3 0 ) 

«2(P) 

Ui(p) J 


( 8 ) 


Exercise 6.2. 


( 9 ) 


6.2 The General Form of the Free Solutions and Their Properties 

From our previous considerations (6.1), ( 6 . 6 ) and (6.30), the general free solution 
must have the form 

1p r ( X ) = U/(P) ( 6 31 ) 

In the above, due to (6.3) and (6.30), the spinor u> r (p) is identical to the row r of 
the matrix (6.30), because 

u/(p) = S u/( 0 ) . (6.32) 

However, we must take note of the fact that this result holds for the special repre- 
sentation of the 7 matrices (3.13) only. The spinors uj r (p ) satisfy 

(^ - e r m 0 c)u> r (p) = 0 , (6.33a) 

u r {p)(ij - £ r m 0 c ) = 0 , (6.33b) 

which are very useful later on. The first one is obvious and directly follows with 
(6.1) from (3.9) and the covariance of the Dirac equation. It represents the Dirac 
equation of a free particle in momentum space. We have e r = +1 for r = 1,2 
and therefore (6.33a) reads (j l — m$c)u r (p) = 0. These are the solutions with 
positive energy. As already mentioned u l (p) and u 2 (p) are given by the two first 
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rows of (6.30). Furthermore, we see directly that in the nonrelativistic limit of 
the representation of (6.30) the lower components of i o ] (p) and u> 2 (p) become 
small; in the limit of the electron being at rest they finally change to the first two 
solutions (6.3). Similar considerations apply for the solutions of negative energy 
C £ r = — 1. r = 3,4). According to (6.30) and (6.3) we have to exchange the large 
components with the small ones in this case. Equation (6.33b) follows from (6.33a) 
by Hermitian conjugation, 

[(j/ - £ r m 0 c)uj r (p )] * = 0 = u r \p) ( 2 / - £ r m 0 cy 

= u/ f (p) (j/ t - £ r moc ) 

= u r \p) (/v 7 Mt - £ r m 0 c) 

= w rt (p) (po7° - Pkl k ~ £ r m 0 c ) . 

Note that the p ^ are real numbers (pj, = p,,), and in the last line (3.12a) was used, 
i.e. 7°t = 7 0 and 7^ H — 7*. Multiplication with 7 0 from the right yields 

u r Hp) (Pvl 7 Mt - s r moc) 7o = 0 = u rf (p) 7 0 (p 0 j° + p k ^ k - £ r m 0 c) 

= u r (p) (j p m 7 m - £ r m 0 c ) 

= u r (p) {jf - £ r m 0 c ) . 

We name (6.33b) the adjoint wave equation because it is valid for the adjoint 
spinor u r (p) = co r Up)jn- With the help of the explicit representation of the u r (p) 
in (6.30) we can now compute the normalization condition (see Exercise 6.3), 

U r (p)u) r '(p) = 6 rr '£ r . (6.34) 


Then, in Exercise 6.4, we will prove the validity of 


u) r He r p)0J r '(£r'P) 


E 

ntQC 2 


(6.35) 


EXERCISE 


6.3 Normalization of the Spinor u/(j>) 


Problem. Prove explicitly the normalization condition 
u r (p)u r \p ) = 6 rr >£ r 
for the spinors given in (6.30). 

Solution. From these equations we get 

l 1 \ 

0 

PzC 

E + m^c 2 
P+c 

\E + moc 2 ) 


E + moc 2 
2moc 2 


w'(p) = 
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u) 2 (p) 


u\p) 


u 4 (p) 


° \ 

1 

P-C 

E + moc 2 

—Pz c 

' E + moc 2 ' 

P' C \ 

E + moc 2 

P+c 

E + moc 2 

1 

\ 0 / 

/ P ~ C \ 

E + moc 2 ' 

~PzC 
E + moc 2 
0 

1 / 


£ 4- moc 2 
2moc 2 


£ + moc 2 
2moc 2 ’ 


£ 4 - rriQC 2 
2moC 2 


Now we take, say, the product u l ^(p)u l (p). Note, that = p-\ 

( 1 \ 

p z c p-C \ E + moc 1 


io\p)u\p) = ( 1,0, 


E + moc 2 ’ E + moc 2 J 2moc 2 
E 4- moc 2 


0 

Pz c 


E + moc 2 

p+c 

V E + m^c 2 / 


2m<)C 2 

1 f 


{ 


2„2 


1 - 




P+P-C 


(E + m 0 c 2 ) 2 (£ 4- moc 2 ) 2 

2 't 


2moc 2 
1 


2 (Px+Py+Pl)^ 
E + m$c — 


(E + m 0 c 2 ) 


+ & + 1Em " C 2 + ^ ^ 

2^£W) <W+2m » V > 

2{E 4- moc 2 )m 0 c 2 
2moc 2 (E + mac 2 ) 


= 1 = j <Si 


Next we calculate 


/ 


ft 


_2, v 3/ * E + m » c2 /'n i P+ c ■ P zC 

u r(p)ur(p) — — : — I 0. 1, -- 


2rri()C l 


i +■ 


E + moc 2 ’ E 4- moc 2 


p+c 


E + moc 2 
p+c 


\ 


E + moc 2 

V 0 J 


_ E + m c 2 J p+c . . 

2moc 2 \ E 4- moc 2 E 4- moc 2 

= 0 = 6 23 £2 • 


Exercise 6.3. 
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Exercise 6.3. 


The third example we give is 

— 4/ \ 4/ \ E + moC 2 
uT(p)oj q (p) = 

z rriQC - 

1 


p+c 


-PlC 


r , 0, -1 


/— p - C \ 
' E + m 0 c 2 ' 

~Pz c 


2moc 2 \E + moc 2 ’ E + mac 2 

{ (Px +Py +Pz ) c 2 -(E + m 0 c 2 ) 2 } 


E + moc 2 
0 
1 


2moc 2 (E + moc 2 ) 
- 1 

2moc 2 (E + m 0 c 2 ) 
— — 1 = <54464 


{p 2 c 2 - E 2 - 2 Em$c 2 — mlc 2 } 


All other combinations can be calculated in the same way. However, there is a 
more elegant procedure available: u r (p)u r '(p) is a Lorentz scalar (cf. Exercise 
5.1), and hence 

uJ r (p)u r '(p) = uJ r (0)u/(0) = w rt (0) 7 V'(0) = 8 rr ,e r , 

which we see at once because of (6.3). But note: The probability density u> rt (p)u r ip) 
is not Lorentz invariant. This quantity is only the fourth component of the four- 
vector. 


EXERCISE 

6.4 Proof of the Relation u) r ^(e r p)u) r ‘ {e r ’p) = 6 rr >(E /m^c 2 ) 

Problem. Show with the help of (6.30) that the following relation holds: 

, , e 

u n (£ r p)u r (e r ,p) = 8 rr > =• . 

m 0 c 2 

Solution. Again we calculate some examples 
(a) r = 1, r' = 1: 



PzC 

E + m 0 c 2 ’ 


E 4 - moc 2 
2moc 2 


p-c 


c. -v m$c 


E + moc 2 J 2moc 2 

[ (Px + Py + Pz) c‘ 
I (E + m 0 c 2 ) 2 


E + m 0 c 2 
p + c 

^ E 4- moc 2 ) 


E + m 0 c 2 ((£ + m 0 c 2 ) 2 + p 2 c 2 ) 
2 moc 2 (E + moc 2 ) 2 

2 E 2 4- 2moc 2 E E 

2moc 2 (E + moc 2 ) n%c 2 11 


I 


6.2 The General Form of the Free Solutions and Their Properties 


169 


(b) r = 2, r' = 3: 


0 , 1 ,+ 


p+c 


p z c \ E + moc 2 


E + moc 2 ’ E + m^c 1 ) 2mac 2 
E + moc 2 


( 

' E + rriQC 2 ' 

P+g 

£ + moc 2 
1 

V o / 


2moc 2 
(c) r = 4, r' = 4: 


P+c P+c 


£ + moc 2 E + moc 


}■ 


0 




P+ c PzC )£+m 0 C 2 

? I , 9 ? A 


E+moc 2 ' E+mac 2, " , 'J 2mac 2 
E + mac 2 f {pi +pl +pl) c 2 


+ 


£ 4- moc 2 
PzC 


E + mac 2 
0 

V 1 / 


2m<)C 2 


+ 1 


1 


2moc 2 (E + moc 2 ) 

E , 

-044 . 


(£ + m 0 c 2 ) 2 

/ 

{p 2 c 2 + E 2 + 2Emoc 2 + mgc 4 } 


moc z 


The remaining combinations can be calculated similarly. 


Exercise 6.4. 


The factor 

E moc 2 1 1 

mac 2 ~ yj\ _ @2 mac 2 ~ _ p 

appearing in (6.35) just cancels the Lorentz contraction of the volume element in 
the direction of motion 

AV' = Ax'Ay'Az' = Ax y/\ - (3 2 AyAz = AVy / 1 - p 2 . 


Consequently the probability in the volume AV' becomes invariant, i.e. 
u} r '*(p)u r ' (p)AV' = u> rt (0)ci/(0)4\V 

= 1 y/\ - (3 2 AV = 1 .AV . 

V^T 2 


(6.36) 


We expect this property to hold for a proper normalization. Let us clearly point out 
the difference between the orthogonalization relations (6.34) and (6.35): In (6.34) 
the spinor u r (p) is orthogonal to the adjoint spinor u r (p) with the same momentum 
argument. On the other hand, for instance in (6.35), the spinor u r \e r ip) with 
positive energy (r' = 1, 2) is orthogonal to the Hermitian conjugate spinor u r f (£rP) 
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(r = 3,4) belonging to negative energy and the reverse momentum argument . 
When considering the latter property we have to bear in mind that the sign of 
the momentum dependent term in the plane wave factor is also reversed for the 
negative energy solutions. Therefore two plane Dirac waves with the same spatial 
momentum p but opposite energy are orthogonal. This follows simply from the 
definition of the plane Dirac waves given in (6.6), (6.13) and (6.33) [compare also 
with (6.31)], 

V£(x) = u) r (p)e~ l£ ' p ^ /h = co r (p)e- i£rPaxn/h e +i£ ^- x/h , 

Po = +}/p 2 + m£c 2 > 0 . 

Accordingly, we have, for a wave with r = 1, 2, the energy E = + \Jp 2 + mfic 2 = 
Pqc and momentum +p, 

^ p ’ 2 (x) = uj l ’ 2 (p)e- ipox0/h e ,p - x/h = u x ' 2 (p)t- l P^/ h , (6.37a) 

and for one with r = 3, 4, the energy E — —\Jp 2 + mfic 2 = —poc and momentum 
+p 

= u 2A (-p)c +ipoX ° /h c ip - x/h = u 3 ' 4 (-p)e +ip ° x<> / h e- i( - p - x / h) . (6.37b) 

Clearly we learn that both waves have the same momentum p (the same factor 
e i P' x / h ) but the opposite energy (e _1/7()X ! h and e+ ip ° x / h , respectively). Hence the 
spinors cu 1,2 (p) belong to positive energy and momentum +p, whereas u; 3,4 (— p) 
correspond to negative energy and the same momentum +p. These spinors are 
orthogonal according to (6.35), and thus (6.37) holds. In fact, the relation (6.35) 
is required to ensure that any two plane waves i> r p ,(x) and are orthogonal in 

the sense of the following scalar product: 

(V’p'IV'p) = J d 3 x Vy^V’pC*) • (6.38) 

Inserting the plane wave spinors of (6.31) we obtain 

k¥ v ’W v ) = J d 3 X u r '^(p')u r (p) exp [-i (erp^/h - £ r 'P^X p /h)] 

= u r ' ^ (p')u r (p) exp [-i (e r 'p' 0 x° /h - £ r p 0 x°x 0 /h )] 

x (27 t) 3 (!> 3 (e rl p'/h - £ r p/h) . (6.39) 

The integration over the spatial coordinates has led to a delta function containing 
the momentum vectors p and p 1 . Thus the scalar product vanishes unless p' = +p 
(if the energies have equal sign) or unless p' = —p (for opposite signs of the 
energy, e r < ^ —£ r )- This is just the condition for which the orthogonality relation 
of the unit spinors uj r (6.35) applies! Thus we obtain for any two plane waves 

(V'p'lV'p) = {&' ~ P)/ h ) 6rr ‘ ■ ( 6 - 4 °) 

Up to a normalization factor this is just the orthogonality property one would expect 
for plane wave states. 
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The closure relation is important too: 

4 

^£rV r a (p)w r l} (p) = 6 a p . (6.41) 

r~\ 

Here the sum extends over all four spinors, taking from the first one only the 
component a and from the second adjoint one solely the component (3. Therefore, 
the closure relation (6.39) expresses a kind of row orthonormality of the matrix 
(6.30). Due to (6.13), in the rest frame (p = 0) of the electron clearly 

4 

^£,<(0)^(0 ) = 6 a0 (6.42) 

r = 1 


is valid. For this reason (6.41) can be traced back to (6.42). Indeed with the help 
of (6.32) we can calculate it directly: 

4 4 

y><(p)^(p) = 5] eXy(-§)v!y(0m0)S x -J (-|) 

r = l r = 1,7, A 

= (-f ) (-§ ) 

-EM-f) *«'(-! W 

7, A 

= E^«(-|)^' (-|)=' s «« • < 643 > 

7 

where in the first line we have used the relation 

— 7°5 ” 1 7 0 (6.44) 


from (3.63). This is again the reason why the adjoint spinor Zu r = 0/^70 appears 
in the closure relation (6.41), but not the Hermitian conjugate u/t . It is in accor- 
dance with the fact that the Lorentz transformation of the spinors is not unitary as 
expressed in (6.44). 


exercise 

6.5 Independence of the Closure Relation from the Representation 
of the Dirac Spinors 

Problem. Show that the relation (6.41) 

4 

J] £ ^a(PH(P) = <W 

r = 1 

is independent of the special representation from the Dirac spinors. 
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Exercise 6.5. 


Solution. Let another representation of the Dirac algebra (£/t = 

U -1 ) be given, and let v r (0) denote the spinors of the free particles at rest, con- 
structed accordingly. We will prove that from (6.41) it follows that 

'E / e r v r a (p)v r (3 (p) = 6 a p . (1) 

r 

As seen previously [see (6.43)], one can show that (1) is equivalent to 

E 6 ^ 0 )v r p(O) = 6 a 0 , ( 2 ) 

r 

and therefore we prove (2). 

With the change of the representation = U-y^U - ', the spinors become 
v r (Q) = Ouf( 0). Hence we have 

52, £rV ^% = 'E,£r(Uuj r ) a (Uu} r )P = J2 £ rU ay LO r i ((Ou^To) p 

r r r ,7 

= E £ ^x = E (u- 1 ) 60 

r >7 r,7,<5 

^ ^ ^ ^ 3 'ySUoi'y(.U 

r, 7 , <5 7,5 

= » 

<5 

which is the form required. 


EXERCISE 


6.6 Proof of Another Closure Relation for Dirac Spinors 
Problem. Prove directly the closure relation 

4 ^ 


E W a( £ rP)^(£rP) = 


r — 1 


m$c 


2 $ ol(3 


Solution. We use the equation 
U r (£ r p) = S w'(0) , 

insert the expression for S given by 

i(_£^) = Icosh ^_£i^£ sinh ^ 


2 

U 


Ip! 2 

0/P. W 

P 


Hcosh^-y £r ^sinh^ 

2 4^ p 2 


(1) 


(2) 


( 3 ) 


and use this to transform (1) into 


6.2 The General Form of the Free Solutions and Their Properties 


173 


A - y ^ u r a (n r p)u>a(e,p) 


= E 

r, 7,$ 


cosh S ay - Y £r( ' a, ' laiP ' sinh ^ J u>'( 0) 


0^(0) fcosh(D<5 5/3 -^ 


£ r(&])S0Pi sigh — 

P 2 


(4) 


The matrices a, are Hermitian [&J = a,] and by explicit multiplication we get 


A = cosh 2 j £>'(0)u;?(0) 


— cosh — sinh — 

2 2 

r,i,7,o 

+ sinh 2 ^ Y ^y(O)cp r s f (O)^ 1 Y'(o! l )ay(o;j)s 0 
r,iJ,y,S P 

In the last term we have used 


Y ^r(&i)ay(&j)60 


(&i ) ay Pi r /r\\ '"t/n\ i r tc\\ r \ /rw (^ ) 60 Pi 

6 f-Lo!y(0)u} p (0) + £ r iO r a (0)uJ s ' (0) 


6c • p 


oc • p 

£r 


£ r 

P \ 

ot'y 

r 


60 P 


For the u/(0 ) 

7ocJ r (0) = £ r u r (0) 

^ X^7o)<XTt4( 0 ) = £rw;(0) 


(5) 


( 6 ) 


(7a) 

(7b) 


is valid; hence, 

w rt (0) = uJ r (0)7o = £ r uj r (0 ) . 


( 8 ) 


Therefore (5) turns out to be 


* (jJ ■■ 1 ^ (_{) (jJ 

cosh — Y £ r u a (0)^(0) - s^h — cosh — 


E 

r,t,7,S 


(<3f( )ayPi r , A , — r . A , r ,n\ — r /r\\ (®i )i/3 Pi 

r — u>' (0)^(0) + u a (0)u> s (0) 


=B 


+ sinh 2 — Y £ r ^y(O)u r g(O)^-(&i)ay(oij)s 0 
r,‘J, 7,6 P 


(9) 


Because of the closure relation (6.41) one can sum over r in the first and the last 
sum to obtain 

A = cosh 2 (^) 6 a 0 - sinh (^j cosh (^j B 

+ sinh 2 (^) Y ^ L ( & i)ocy(&j)y0 

ij,7 P 


Exercise 6 . 6 . 


( 10 ) 
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Exercise 6.6. 


According to Exercise 6.1, (2) one has 


^2piPj(&i)ay(&j) 10 = 'Y^PiPj6i i 6ij) a p = p 2 6 a0 , 

'V, 7 ij 

and consequently (10) changes to 

A = cosh 2 (^j 6 a0 + sinh 2 6 a0 - B sinh cosh 
= <W ^2 cosh 2 - lj - B sinh cosh 

= ^ { 2 ^r -')~ B sinh (!) cosh (I) 

=iw, ;^“ Ssinh (i) cosh (!) - <>» 

To finish the proof, we need to show that B vanishes. With the help of (7a) and 
(9) this term reads 


*= E 
= E 

'•>',7,6 L 

Pi 


+ ^ (0)g ; (0 y WWl 


mx, 2 (7,w, g ^. o) + 

P P 


p [(“<7to)a/3 + (70<2, ) q/3 ] , 


( 12 ) 


where again we have used (6.41). But now it holds that d, 7 0 = — y 0 &i and conse- 
quently (12) vanishes. Hence we have in fact: 


Y^u r a (e r p)uj r J(£ r p) 


E 

m Q c 2 


fia/3 


6.3 Polarized Electrons in Relativistic Theory 

Electrons at rest are described by the spinors (6.3) and (6.13). These electrons are 
polarized in the z direction. For example, the spinor 

(°\ 

- 2 (°)= J 

\0/ 

describes an electron at rest with spin projection -h/2 on the z direction. If the 
rotation operator [(3.56), (3.57)] 

S R = e^-*/ 2 , 


(6.45) 
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where s is the unit vector along the rotation axis, acts on the states (6.3), one will 
obtain states which are - according to s -polarized in any arbitrary direction. A 
state (spinor) u, which is polarized in the direction s is an eigenstate of the spin 
operator S • s in this direction: 


S • suj 



or, with S ~ HS/2, 


£ • su — u 


(6.46) 


Our next task is to turn this equation into a covariant form. This means we are 
searching for the generalization of a (three-)spin vector into a (four-)spin vector. 
First we note: 


£ 


a 0 

0 <7 


contains the Pauli matrices & in the main diagonal; thus the form of the 4-spinor 
equation (6.46) will be similar to the 2-component Pauli theory. In the relativistic 
theory one commonly uses a different notation. The spinor of a (free) solution 
of the Dirac equation with positive energy, momentum p M and spin vector s M is 
denoted by 

u(p,s) . 

Hence u(p,s) satisfies the equation 

(rf - moc) a pup(p,s) = 0 . (6.47) 

Now we have to clarify the meaning of a four-spin vector s 11 . To do this we start 
with the polarization vector s in the rest system, which is a unit vector (s • s = 1), 
and write 


(OR.S. = (Orest sytem = (0, 8 ) . (6.48) 

In an arbitrary inertial system we can get the four-spinor by a Lorentz transfor- 
mation of Cs m )r.s. from the rest sytem: 

j^o'VOr.s. . (6.49) 

The are the coefficients of the Lorentz transformation from the moving system 
to the rest sytem. Hence it also holds, e.g. for the momentum, that 

= a M „(/>") r.s. , where (6.50) 

(Or.s. = ( m oC , 0,0,0) . (6.51) 

Because of the Lorentz invariance of the four-dimensional scalar product it now 
follows immediately that 

= Cv)r.s.( s/1, )r.s. = — s • s = — 1 (6.52) 


and 
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/ 0 \ 


= (p'‘)r.s.(^)r.s. = (m 0 c, 0,0,0) 


~Sr 


= 0 . 


(6.53) 


\-U 


In the rest system u satisfies (6.46), and hence 
£ • ■sm((p)r.s.,(^)r.s.) = u((p)r.s.,(s)r.s.) ■ 


(6.54) 


In the moving system we obtain u(p,s) from u ((p)r.s., O'Or.s.) by the Lorentz 
transformation S (a) for spinors. Let us consider a solution v(p, s) of the free Dirac 
equation for negative energy. Because of (6.33a) it satisfies 


(rf + moc)v(p,s) = 0 . (6.55) 

We require that the solution v(p,s) has the polarization -s in the rest sytem, i.e. 
in the rest system 

£-au((p) R .s.,(s) R .s.) = -v{(p)r.s.,(s)r.s.) (6.56) 

is valid. The minus sign on the rhs, which on first sight seems to be paradoxical, is 
remarkable. Later we will interpret this within the hole theory. The spinors u r (p), 
which arise from the rest spinors (6.13) obviously fulfill the demands (6.54) and 
(6.56), if we define s = e z and use the following relations: 

«(P)“z) =<Ap) , v(p,u z ) =u> 4 (p) , 

u(p,-u z ) = u 2 (p) , v(p, -u z ) = u 3 (p) . (6.57) 

There is that four-vector in the moving system which arises from the Lorentz 
transformation of the unit vector u z in the z direction, which is defined in the rest 
system by 

{u") RS =(0,u z ) = (0, 0,0,1) . (6.58) 

One may wonder why the spin projection (6.56) for electrons with negative energy 
is defined with the opposite sign. The reason for this becomes obvious later when 
we interpret an electron with negative energy, momentum -p and spin direction 
minus — i as a positron with positive energy, momentum +p and spin direc- 
tion plus = t • Then the relations given in (6.56) and (6.57) can be understood 
in the positron language! The spinor v ((p)r.s., (s)r.s.) describes a positron with 
the opposite spin to that which might first seem to be the case (in the “electron 
language”). We note that, according to this, an arbitrary spinor is characterized by 
its momentum p h , the sign of the energy and the polarization (s l ‘ ) R S in the rest 
system. [See the last part of Chap. 7 and also Chap. 12 (Hole Theory).] 
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In practical calculations of quantum electrodynamic (QED) processes we will be- 
come acquainted with a technique of calculation which allows the simple treatment 
of complicated expressions; especially the calculation of traces of products of many 
7 matrices. It is based on a projection procedure, i.e. a method to project a spinor 
with a given sign of energy and fixed polarization out of a general wave function or 
a wave packet. The appropriate operators which achieve this are called projection 
operators. In the nonrelativistic case, say, 


P± 


1 ± 
2 


(7.1) 


is a projection operator for spin up (+) or down (— ). Acting on an arbitrary state 
P± just takes out the corresponding parts. Now we want to generalize this concept 
to the relativistic case and to search for four operators which project those four 
independent parts belonging to positive or negative energy with spin up or spin 
down, and with the same momentum p, out of an arbitrary free solution of the 
Dirac equation (i.e. out of a plane wave with momentum p). The four projection 
operators are denoted by P r (p). They should, of course, be in covariant form so 
that they can be given in any Lorentz system (by transformation) in an easy way. 
We denote those four projectors explicitly as 

K(p) = P(Pn,u z ,e r ) . 


They should obey the conditions 

P r (p)u r '(p) = 6 rr >u r '(p ) and (7.2) 

Pr(p)P r >(p) = Kr'Pr(p) • (7.3) 

Both equations show the projection properties of P r (p ) very clearly. If we now 
recall (6.33a), i.e. 


(jj - e r m 0 c)uj r (p ) = 0 = (e r rf - m 0 c V(p) 


(7.4) 


we at once find the projection operator for eigenstates with positive or negative 
energy : 


Mp) — 


e r jj + m 0 c 
2moC 


(7.5) 


Indeed, with the help of (7.4) we find 
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A r (p)u r (p) 


e r jf + m 0 c 
2moc 


u r (p) 


2moc 

2moC 


u r (p) = u /(p) 


and furthermore because of 


(7.6) 


= InlvP^P'' = ^(ifily + lvl [ i)p IX p V = g, L vP ,l p 1 ' = p 2 = -J ~ p 2 


then 


A r (p)A r '(p) = 


m y + C V _2 _ m 2 r 2 

2 — p — m 0 c 


(e r tf + m 0 c)(£ r 'rf + mpc) 
4 wq c 2 


ZrSr'lftf + fflpC 2 + (e r + e r -)mcjl 

4 niQC 2 

mpC 2 (l + £ r £ r ') + mctfiEr + £ r >) 
Arrive 2 

mlc 2 ( 1 + e r e r ') + m 0 c^£ r ( 1 + E r e r <) 


Am^c 2 

(1 + £ r £r ') (me + £rrf) (1 + £ r £r') t , , 

= —2-”0'- ^ c 2 = —^—A(p) 


(7.7) 


Obviously all the operators (7.5) decompose into only two types, namely those 
with £ r = +1 (which we call A + ) and the ones with £ r = —1 (which we call yl_): 


A±(p) = 


±jf + me 
2me 


(7.8) 


Notice that jf is written here, and not ff. Hence the momentum (with real numbers 
as components) and not the momentum operator occurs in the projector (7.8). We 
may present the relations (7.7) more transparently by writing explicitly 


(a) 

(A + f 

= A + 

(b) 

(A .) 2 

= A- 

(c) 

A+A - 

= 0 

(d) 

A+ + A_ 

= l 


(7.9) 


These are the typical properties an energy projection operator must have. Besides, 
the expression (7.5) is also covariant, so that all required conditions are fulfilled. 

Now we consider the spin— projection operator. Again we consider the rest 
frame, where the spin can be easily described. We already know from (7.1) that 
in this case the projection operator for “spin up” or “spin down” is given in the 
nonrelativistic limit by 


P± = 


1 ±»3 
2 


If we define the spin-projection operator with respect to an arbitrary axis given by 
the unit vector u (u • u = 1) in a nonrelativistic theory, then (7.1) is generalized 
to 



7. Projection Operators for Energy and Spin 


179 


P(u) = 


1 + <x • u 
2 


(7.10) 


It is obvious that with u z = (0, 0, +1), the special case (7.1) is contained within 
the general expression (7.10). But this is still nonrelativistic. In the Dirac theory 
we need the relativistic covariant generalization of the operators (7.1) or (7.10). 
For that purpose we make use of the four-component vector 


which is given in the rest system of the electron by 

«) RS = (0,0,0, l) = (0,u z ) 


(7.11) 


(7.12) 


[cf. (6.58)]. Thus, by Lorentz transformation into arbitrary inertial systems it fol- 
lows that 


<=o , v(“r)*.s. ■ < 7I3 > 

With (7.12) we can write the spin-projection operator in the rest system, which is 
now extended to the fourth dimension and labelled by 27(w z ). This generalization is 
achieved by first denoting 27(w 3 ) where u\ is the third component of u z of (7.12): 


£ = 1 + ^3 = 1 + 7573 (« 3 ) RS . 7o = 1+7sQ*z) r . s .70 

' z) 2 2 2 


(7.14) 


This is because (^)r.s. ~ ( u z 7^)r.s. = (w 3 )r.s .73 and therefore 


7573 (« 3 ) R . S . 


7o = 757370 = i7°7 1 7 2 7 3 737o 

— ;J^1J ..3 . .3 o _ 

— 7 7 7 7 7 = +17 777 


= +i7 1 7 2 


0 <j\ \ / 0 02 

—b\ Oyl —a 2 0 


= 1 


-<71(72 

0 


0 

-b\U2 


<73 0 

0 <73 


= £3 


(7.15) 


and 1 bjbj = i + 6 ir Now the question arises how to generalize (7.14) to a 
co variant form. The factor 70 is disturbing, while 75 i/i z is a covariant expression. 
In the rest frame the effect of 70 on the rest spinors u/( 0) is given solely by the 
factor ±1. For the spinors u> 1,2 (0) we can therefore easily omit 70 in (7.14). If we 
do the same for the spinors w 3,4 (0), the effect in the rest frame is 


S (t/|) w 3,4 (0) = 


1 + 75 (fi) 


R.S. , ,3,4 


w J>4 (0) = 




w 3 - 4 (0) 


= - — 75 (jpR-s. 7 y ,4 (0) 

= f 0 • w 3 (0) 

\ 1 • w 4 (0) ' 


1-^3 

2 


w 3 ’ 4 (0) 


(7.16) 


1 See W. Greiner: Quantum Mechanics - An Introduction , 3rd ed. (Springer, Berlin, Hei- 

delberg 1994), Exercise 13.2. 
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This would be exactly opposite to what we may naively expect. Analogously, one 
easily verifies that 


Now we see that the covariant spin-projection operator 

£(«z) = 


(7.17) 


(7.18) 


satisfies the following relations in the rest system [denoting E ((u z ) r.s.) = £’(m z )( 0)]: 
£(« z )(0)w , (0) = u; 1 (0) , r(- Mz )(0)cA0) = 0 , 

£(« z )(0)u> 2 (0) = 0 , E(-u z )(0)oj 2 (0) = cj 2 (0) , 

E(u z )( 0)u, 3 (0) = 0 , r(-« z )(0)u, 3 (0) = cc 3 (0) , 

£(w z )(0)u> 4 (0) = oj\0) , r(-« z )(0)o; 4 (0) = 0 , (7.19a) 

which we can rewrite with the help of the definitions (6.57): 

E(u z )(0)u(0,u z ) = u(0,u z ) , E(—u z )(0)u(0, u z ) = 0 , 


E(u z )(0)u(0, -u z ) = 0 , 

r(« z )(0)u(0, w z ) — 0 , 
r(w z )(0M0,u z ) = v(o ,u z ) 


E(-u z )(0)u(0, -u z ) = m(0, -u z ) 
E(-u z )(0)v(0, -u z ) = v(0, -u z ) 
E(-u z )(0)v(0,u z ) = 0 . 


(7.19b) 


Notice, the operator E{u z ) should not be confused with the operator E\ Because 
(7.18) is covariant, these relations are also valid in a moving system in which the 
spinors are given by u(p, u z ) . . .: 

E(u z )u(p,u z ) =u(p,u z ) , 

E(u z )v(p,u z ) =v(p,u z ) , 

E(-u z )u(p,u z ) = E(-u z )v(p, w z ) — 0 . (7.20) 


The covariant generalization (7.18) of the nonrelativistic spin-projection operator 
(7.14) corresponds naturally to the convention, given by (6.56) and (6.57). Therein 
the spin projections of negative-energy states are contragredient (i.e. opposite) to 
those of normal “states” of positive energy [see (6.54) and (6.56)]. The positron 
interpretation of the hole theory, which we have already preliminary considered 
in (6.57), comes up here in a natural way through the covariant spin-projection 
operator. Furthermore, we generalize the spin-projection operator (7.18) for an 
arbitrary spin vector with = 0 [see (6.52) and (5.53)]: 

£ (j) = l+jsl ( 7 . 21) 

Because of the covariance, it follows that as a generalization of (7.20) the u(p,s ) 
and v(p,s) obey the relations [see (6.48)-(6.57)] 

E(s)u(p,s ) =u(p,s) , 

E(s)v(p,s ) =v(p,s) , 

E{—s)u(p,s) = E(—s)v(p,s ) = 0 


(7.22) 
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7.1 Simultaneous Projections of Energy and Spin 

With the projection operators A±(p) for the energy and E(s) for the spin we can 
easily construct projectors for energy and spin. As we already know, the motion 
of a free particle is completely determined by 

the four momentum p fl , 

the sign of the energy e , 
and the polarization s M , 

with .s >L p fl - 0. Therefore the energy-spirt projectors 

Pi(p) = A + (p)ZJ(u z ) , Piip) = A + (p)Z(-u z ) , 

hip) = A-(p)E{-u z ) , P 4 (p) = A-{p)E(u z ) , (7.23) 

which are composed of (7.8) and (7.18), determine the free Dirac waves, these 
being projected out of superposed wave functions (wave packets). The order of the 
energy and spin projection in (7.23) does not matter because 

[Z(s),A ± (p)}_ = 0 , (7.24) 

as s tl p fl = 0 is valid. This is easy to verify, because the commutator (7.24) is 
equivalent to the commutator 

[i/,7 5 y]_=0 , (7.25) 

which we immediately prove: 

= 7 M p At i7°7 1 7 2 7 3 7 J/ ^ = -i7°7 1 7 2 7 3 7 A ‘7 %/V 
= -75(2 - 7"7 = 75 jrf ■ 

Here g^s^p^ = s^p^ = s -p = 0 according to (6.53). Because of (7.2), (7.6) and 
(7.20) or (7.19a), we easily confirm that the defining equations (7.2), i.e. 

P r (p)u r '(p) = 6 rr ,u r '(p) (7.26) 

are fulfilled! 

In the following 2 we will appreciate the usefulness of these projection operators, 
which at first seem to be a little artificial. With their help we will often perform 
practical calculations, without explicit use of 7 matrices and free spinors. Indeed, 
we will see that an explicit calculation, component by component, can be replaced 
by a rather elegant one through the use of the projectors. 

According to (6.31) the general free solution was given by 

tP r (x) = u r (p)e- i£rP ^ /h . 

For r = 1,2 these are free waves of positive energy (e r po = Po) and momentum 
Pii> l,2 (x) — Pi^ ll2 (x). For r — 3,4 these waves have negative energy (e r po = —po) 
and the eigenvalues of the momentum operators are 

2 See in particular W. Greiner, J. Reinhardt: Quantum Electrodynamics , 2nd ed. (Springer, 

Berlin, Heidelberg 1994). 
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Pii> 3,4 (x ) = £3,4 Pi1p 3,4 (x) = -Pi\l?’ 4 (x) , 

that is, they are negative (-/>,). We call f 3 ' 4 (x) “spinors with negative energy”. 
They describe antiparticles of positive energy and positive momentum p,. This 
striking redefinition of plane waves with negative energy has already occured for 
the spin [see discussion following (6.57)]. To this end the eigenvalues of the mo- 
mentum and spin operators agree with the momentum or spin direction of the 
waves of positive energy. For waves with negative energy we define it the other 
way round. In other words: a particle with negative energy and momentum eigen- 
value -pi has momentum +p,-, and a particle with spin eigenvalue -u z has the spin 
+« z . This seemingly strange definition has a deeper reason: in the framework of the 
hole theory, which will follow soon, we interpret an electron with negative energy, 
momentum — p, and spin -u z as a positron (i.e. the antiparticle of the electron ) 
with positive energy, momentum I p, and spin u, . This occurs as a natural result. 
Thus we hold on to our definition. Accordingly ^> 3,4 (x) is an electron wave func- 
tion with negative energy, negative momentum and negative spin projection [see 
(7.19a)] or - and this is the physically correct description - a positron with positive 
energy, positive momentum and positive spin projection. For the later property the 
redefinition (6.57) is necessary, which means the change of r = 3,4^r = 4,3. 



8. Wave Packets of Plane Dirac Waves 


Already in Sect. 2.3 we dealt with the single particle aspects of Dirac waves. 
In order to gain a deeper understanding and a possible interpretation of the free 
solutions, we study wave packets. These are superpositions of plane waves which 
yield localized wave functions in space time. Since the Dirac equation is a linear 
wave equation, the wave packets are also solutions of the free Dirac equation, 
which is just the superposition principle. A wave packet of plane waves with 
positive energy has the form 

^ (+) (cc,0= / ' (8J) 

J v2i xh v L ± s 


The amplitudes b(p,s ) determine the admixture of the plane waves u(p,s ) • 
Q -'Pf x '‘ / h to the wave packet. The “(+)” indicates that a superposition of only 
positive-energy plane waves is taken. Normalizing to unity implies 


J ' l p {+) Hx,t)ip {+) (x,t)d' } x = 1 = J d?p j 

^^\J^^bHp,s)b(p\s')u ] (jj,s) u (p',s , )e~' (pn ~ p « )x<)/h 

p i (p-p f )-x/h 

X / , d 3 Jt 


/ 


yj2'Kh 3 \/2'Kh 3 

"■ ’V*" 

Sip-p') 


d^iiE/moc 1 ) because of (6.35) 


= / ^ r ~~b ] (p, s)b(p, s') (p , s)u(p , s') 

= J d 3 pY^b ] (p,s)b(p,s) 

= = 1 • 


(8.2) 


The factor 


exp 


:(Po - Po)x°j = f~^( E ~ E '} X ~ 


is equal to 1 because of 
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E — cy/p 2 + moc 2 — cy/p' 2 + niQC 2 = E' 

utilizing the property of 6(p - p'). The next step is to find the current of such a 
packet, given the velocity operator ca; thus we obtain 

J (+) — y 'i/> (+ ^ca'0 (+) d 3 x . (8.3) 

In the calculation of this expression (shown in Examples 8.1 and 8.2), we will 
learn a new and elegant mathematical technique, i.e. the Gordon decomposition. 
The result is 

j;*> = I a £ |i0>, s )| 2 = (^jr) + = «%)<} + ■ (8-4) 

Because of the normalization (8.2), in the second step we have written y. <+) 
as an expectation value with respect to the wave packet of positive energy and 
labelled this ( ) + . In this way the mean current of an arbitrary wave packet of 
plane waves of positive energy is equal to the expectation value of the classical 
group velocity v gT = c 2 p/E. This corresponds to the Ehrenfest theorem of the 
Schrodinger theory and agrees with our earlier considerations (2.62) and (2.67), 
where we restricted ourselves to the even part of the velocity operator to get the 
same result. Obviously the restriction to the even part of an operator is equivalent 
to the restriction to those wave packets which are constructed from one sort of 
solution (of positive or negative energy) only. 


EXERCISE 


8.1 The Gordon Decomposition 

V-’i and r tp 2 are considered to be two arbitrary solutions of the free Dirac equation, 
that is 


(fl-moc)'ip i=0 , 

and 

(p-moc)'ip 2 = 0 , 

The second set of equations holds 
Then 


^i( - P - m 0 c) = 0 , 

( 1 ) 

^ 2 ( ~ P ~ m 0 c) = 0 . 
for the adjoint spinors, as will be proven below. 


^ 2 7*Vi = ^ - ( P *V 2 ) (^ 2 <x*“>i) 


is valid. We prove this with the help of the relation 

4V = 7 ^ 7 "^ = a^bvl^i' 


= a„b v \ - (7*y + YY) + 2 (7 M 7" - 7 1/ 7 m ) 


2 

= a ^ - iaWcr^ , 




( 2 ) 


( 3 ) 
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which is valid for two arbitrary four- vectors a ^ and b^. Furthermore, we introduce 
the definition 


tp =. (p^t )Y , ( 4 ) 

der 

which means that the momentum operator acts to the left onto the function ip(x). 
Because of (1) we obtain 

0 = ti{ ~ P ~ ™oc)4t\ + ti4{$ - m oc)ti ■ ( 5 ) 

This happens because the terms on the rhs are zero [according to (1)]. In particular 
the first term on the rhs can be rewritten as 


(-Pfi'tp 27 ^ ~ mocip 2 ) 4ti • 

That it vanishes is obvious from the fourth equation (1). Let us deduce this from the 
second equation (1). It follows by multiplication with 70 and Hermitian conjugation, 
that 

(7°(# - me )^ 2 ) f = 0 = 4>l ( 7 ^ p l - m 0 c) 7 0t 



= +pj l V , 27 /lt 7 0t “ mociPl^ 



= -Puth^l^ ~ moctl'l/ 01 ■ 

(6) 

Because of 7't = 

—7' , 7°^ = 7 0 , we can write 


yy^ty) _ y 

or 7°7 l/ ^ = 7^7° or 7*^7° — 7°7 l/ 

(7) 


and for (6) we achieve 

-put 27V* - mocth 0 

= 0 = -p^tiY - mct>2 = ti{ - P - m 0 c ) 


( 8 ) 


Thus we have proven (5), from which we can furthermore conclude that 

0 = -2m 0 cip 2 4t\ + ti[4P - P4]t i 

= —2moc / ip 2 <jhp\ + t 2 [a^ p M - ia^ p - p + i p M a"v] 

= -2^00^27^^! +ti[P' 1 - P^ ~'^p l 't ll v -ip u a tM v ]i>^ ■ (9) 

Because a M was arbitrary the coefficient of a ^ must vanish, that is 
cti^t i = 1 - {p^ti) t>\} - ~^P V i) • 

This is just the statement of (2). This splitting of the matrix * s called 

Gordon decomposition. 1 Its physical meaning is that the Dirac current density 
7^1 can be split up into a convection current density 

1 Surprisingly, this procedure was derived by Walter Gordon in Z. Phys. 50, 630 (1928). 


Exercise 8.1. 
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Exercise 8.1. 


— fitful - </>l] 


( 10 ) 


and spin-current density 

2^" 


( 11 ) 


EXERCISE 

8.2 Calculation of the Expectation Value of a Velocity Operator 


Problem. Calculate the expectation value of the velocity operator of a wave packet 
consisting of plane waves with positive energy, using the Gordon decomposition. 

Solution. A wave packet of Dirac plane waves of positive energy is given by 

i> w (x,t) = J (2^/2 ^-E^ s)u{p,s)e p » x "l h . ( 1 ) 

This gives the expectation value of the velocity operator (do not confuse the given 
symbol for wave packets of positive energy (+) with that for Hermitian conjugation 

t): 


y'(+) = J d 3 x 

= J 'f c^f’ ^ x = c J d 3 x 


-HI 


d 3 p d 3 p f 


(2irh) 3 / 2 (2'Kh) 3 / 2 V E 


m 0 c 2 Im^c 2 


b*(p',s')b(p,s)e (1 




u(p ,s )cy l u(p,s) . 


( 2 ) 


±s .is 7 


Via the Gordon decomposition [Example 8.1, (2)] of the spinor matrix element this 
becomes 


J ‘ m = II 6,pd3p '^ £ b-(p',s')b(p,s) 


i 


±s,±s f 

Q ii/h){p^-p^)x^ j 


x [(p" + P‘) ~ i {~P ,V +P U ) &‘u] u(p,s) . 
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Because of the identity 


/ 


lt _ , o e -(i /h)(p"-p')Xi 

*(i/h)(p Q -p 0 )x u 


d 3 x 


\JC1t\K) \/(2nh) 


= 6(p n -p ‘)&\ p 


i (E' 


k \ u 

h \ c c 


=1 


this evaluates to be 

b*(p,s')b(p,s) u{p,s')u(p,s) 

S V ' 

—6 ss i [from (6.34)] 

= j ^p^-^\ b (p^)\ 2 . 

As we can see, the spin-dependent part of the current vanishes. 


r<(+) 


■/ 


d > 


moc 2 2p‘ 

E IniQ 


E 

±s,±s 


( 3 ) 


Exercise 8.2. 


Let us resume our earlier discussion [see (2.61) to (2.70)] on the velocity 
operator for one particle in relativistic theory: In Schrodinger’s theory the velocity 
operator v = p/m was proportional to momentum, but this is no longer the case in 
Dirac’s relativistic theory. The velocity operator for free particles ca is no longer 
a temporal constant, because of 


dca 

HT 




(8.5) 


From (8.4) we can even conclude that one needs the solutions of negative energy to 
construct eigenfunctions of ca. Further, wave packets consisting of plane waves 
with only positive energy have the expectation value of the velocity |(cd')| ~ 
\(c 2 pi/E)\ < c, whereas the eigenvalues of ca 1 are exactly ±c. This motivates us 
to consider wave packets made up of the complete set of plane Dirac waves, i.e. 
plane waves with both positive and negative energy. Instead of (8.1) we now write 


ip(x,t) 


r d 3 p I mpc 2 

J hf * E 

x \b(P’ s ) u (P’ s ) e ' PflX>1 / h + d*(p,s)v(p,s)e. +ip,lX ' 

±S 


( 8 . 6 ) 


The coefficient b(p,s) are the probability amplitudes for waves with positive en- 
ergy, whereas d*(p,s ) are those for negative energy. The probability of finding a 
particle anywhere must be one (see Exercise 8.3): 

d 3 x ^(x,t)ip(x,t) = j d 3 p^[\b(p,s)\ 2 + \d(p,s)\ 2 ] = 1 

J ±s 


(8.7) 
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From this we calculate the current of the wave packet (see Exercise 8.4): 

J k = J d 3 x t)ca k %p(x, t) 

= / d3 ^ j E {\ b (P’ s )\ 2 + l rf 0M)l 2 ) 

+ i c Y, b*(-p,s')d*(p,s)e 2ix oPo/^u(-p,s , )a k(> v(p,s ) 

±S,±S f 


-ic ^ d(—p,s')b(p,s)e 2vc ° Po/h v(-p,s')a k0 u(p,s) 

±S,±S f 


( 8 . 8 ) 


Here u{—p,s ) means m ( \/p 2 + nioc 2 , — p,s ). This abbreviation is not particularly 
obvious, but it is absolutely clear what is meant. In the following we use this 
notation for the coefficients b, d* and for v(-p,s ) too. 

If we compare (8.8) with (8.4) we see that additional terms appear in (8.8). 
The first term in (8.8) represents the time-independent group velocity that also 
appears in (8.4). The second and third terms are interferences of solutions with 
positive and negative energy, which oscillate time-dependently because of the fac- 
tors exp( ±2i/7()A'() /7)). The frequency of this Zitterbewegung 2 is 


2poc 2n%c 2 

h > h 


2 x 10 21 s“‘ 


(8.9) 


and its strength is proportional to the amplitudes d(p, s ) of the waves with negative 
energy in the wave packet. See Exercise 8.5! 


EXERCISE 


8.3 Calculation of the Norm of a Wave Packet 


Problem. Calculate the norm of the general wave packet built out of plane Dirac 
waves of positive and negative energies 


x X] \b(p,s)u(p,s)e~ ip f lX>i/h + d*(p,s)v(p,s)e +ip P x> * /h 


Solution. The current is defined as 

jp, = c J d 3 x^(x,t)^^(x,t) , ( 1 ) 

and the wave packet reads (see above) 


2 This name stems from E. Schrodinger: Sitzungsber. PreuB. Akad. Wiss., Phys.-Math. 24, 
418 (1930) and in German means literally “trembling motion”. 
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-J 


d 3 p 


(2irh) 3 / 2 V E 


m 0 c z 


[b(p,s)u(p,s)e~ ipx / h + d*(p , s)v(p,s)c +ip - x / h 

±S 


( 2 ) 


The zero component of the current (probability) is then evaluated as 


W = -J 


c 


0 


J d 3 x ip\x,t)ip(x,t) 



d 3 x d 3 p d 3 p f niQC 2 
(2nh) 3 y/EE 1 


±tf ,S ' 


+ b*(p / ,s')d*(p,s)u ji (p l ,s l )v(p,s)e. ,(p+p ) ' x/h 
+ s>(p, s)u V, s')«(p, s) e - i 0’+P , >-*/» 

+ d(p', s’)d*(p, s)v*(p', s')v(p , s)e l(/ ' _p,) ' :c/ft l 


( 3 ) 


Integrating over (fix /{Intif leads to 5 functions 6 (3) (p' ±p). Because pa = E — 
c 4 4- p 2 c 2 , the energies po = p^E — E f ) are identical if the momenta |p| and 
|p' | are the same. Then we can also integrate over d 3 p f and obtain 

/ 2 
d 3 p^- Y 

±s,s' 

+ d(p,s')d*(p,s)v i (p,s')v(p,s) 

+ b*(-p,s')d*(p, s)u\-p,s')v(p, s ) e 2ipa ^ n 

+ d(-p,s , )b(p,s)vH-p,s>(p,s)e- 2ipaXo/h } . (4) 

From (6.35) we have 


u*(p,s f )u(p,s) = v*(p,s')v(p,s) = ~6 SS > 

itiqc 1 


( 5 ) 


u\-p,s')v(p,s) = v\-p,s')u(p,s) = 0 , (6) 

and therefore 

W " J \ b (P,s )\ 2 + \d(p,s)\ 2 } = l • ( 7 ) 


Exercise 8.3. 


In the very last step we were careful to normalize the total probability W to 1. 
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exercise 

8.4 Calculation of the Current for a Wave Packet 


Problem. Calculate the current 


J k = 


= c J ip\x,t)a k ijj(x,t)d } x — c J ^(x,t)j k ilj(x,t)d 3 x 

for the general wave packet of Exercise 8.3. 


Solution. The space components are calculated in exactly the same manner as in 
the last Exercise (8.3): 

J k = c J d 3 x ip(x, t)j k ip(x,t) 

= J d 3 x {#V - (p k *l>)i> ~ ipA'tpv kl/ i’)} , (1) 

where we used the Gordon decomposition (Example 8.1). Inserting (2) of Exercise 
8.3 yields 

jk = J_ f f f d 3 xd 3 pdV m 0 c 2 

~ 2 m 0 JJJ (Inti ) 3 y/EE' 

X { b *(p , y s ')b(P,s)u(p , ,S , )u(p,s)p k 

,s' 

- b*(p', s')d*(p, s)u(p', s')v(p,s)p k e i(j ’ + P" > - x / h 
+ dip', s')b(p, s)v(p', s')u{p,s)p k t~' ip+p) ' x ^ h 
-d(p',s , )d*(p,s)v(p , ,s')v(p,s)p k e- i( P-P l) -^ h 
+ b*(p' ,s')bip,s)u(p' , s')u(p , s)p' k e~ l(p ~ p )' x / h 
+ b*ip',s')d*ip,s)uip',s')v(p,s)p' k e +iip+p,) - x / h 

- d{p' , s')bip, s)v(p', s')uip,s)p' k e~' (p+p) ' x ^ h 

- d(p', s')d*ip, s)vip' , s')vip, s)p' k t' (p ~ p ) ' x / h 

- i b*ip',s')bip,s)uip',s')a k,/ uip,s)ip u - pl)e~ l(p ~ p,) ' x/h 
+ i b*ip', s')d*ip, s)u(p',s')a ku vip, s)ip v + p' u ) e i(p+p ' ) ' x l h 

- \dip’ ,s')bip,s)vip' ,s')d kl/ uip,s)ip u + p ' i/ ) e - i< ~P+ p 'y x / h 

+ i rf(p^JV(p,J)w(p^JO^u(p,J)(p v -^)e i ^-P , >■ x / ft } . (2) 


Integrating again over d 3 x/(27r/i) 3 we get 6 functions S 0) (p' ±p). The integration 
over d 3 p' then yields 
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vit 


- /« 

'■mo J 


3 w 0 c 
P- 


2 m Q J E 

yZ ( 2 P k [b*(p,s')b(p,s)u(p,s l )u(p,s) 


±s,s' 


- d(p,s')d*(p,s)v(p,s')v(p,s)] 

+ 2p k [d(-p,s > )b(p,s)v(-p,s , )u(p,s)e~ 2 ' PaXo/h 
-b*(-p,s , )d*(p,sM-p,s')v(p,s)e 2i P^/ h } 

2\f 

+ \b*(-p,s')d*(p,s)u(-p,s')a k0 v(p,s)e 2i P^ h 

c L 

- d{-p, s ')b(p,s)v{-p,s')d k0 u(p, s) e- 2i ^o/^] } . 


From the orthogonality relation (6.34) we get 

u(p , s')u(p , s) = = -v(p,s')v(p,s) , 

u(-p,s')v(p,s) = 0 = v(-p,s')u(p,s) . 


Therefore, from (3) only 

•/* = J dV j ^ [\b(p,s)\ 2 + \d(p,s)\ 2 ] 


k 2 

p K c L 


+ ic [b*(-p,s , )d*(p,s)u(-p,s , )a k0 v(p,s)e 2i P° Xt)/h 

discs' . 

- d(-p,s , )b(p,s)v(-p,s , )a k0 u(p,s)e- 2i P^ h ] i 


j 


remains. This is the earlier used result (8.8). 


( 3 ) 


( 4 ) 


( 5 ) 


Exercise 8.4. 


exercise 

8.5 Temporal Development of a Wave Packet 
with Gaussian Density Distribution 


Problem. At time t = 0 the following wave packet with Gaussian density distri- 
bution is defined as 


4>\x,0,s) 


1 


(nd 2 ) 3 / 4 


r'^/^cAo) 


(i) 


Determine the wave packet at time t developing from (1). Consider the intensity of 
the negative energy solutions in the wave packet. What does one learn in general 
about the applicability of the one-particle interpretation of the Dirac equation? 
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Exercise 8.5. 


Solution. Equation (1) is a boundary condition for the general wave packet 

s'=±s 

+ d*(p,s')v(p,s')e m - p - x) / h ] . (2) 

The requirement 

ip(x,0) = ip'(x,0,s) (3) 


leads to equations that determine the coefficients b(p, s') and d*(p,s'). In particular, 
it follows from 


/ 


d 3 p 

( 2irh ) 3 / 2 


me 2 
E ^ 

s' = ±s 


d*(p,s , )v(p,s')e- ipx / h 



d 3 p 

(Ink) 3 / 2 



( 4 ) 


(where we have inserted p = —p' and p 0 = -p' Q ) that 


/ 


m§c L 


Y [ b (P’ s ') u (Pi s, )+ d *(p l ’ sl ) v (p , ’ s ')] 


s' = ±s 


x e 


\p-x/h_ 


d 3 P 


i 


(27T h) 3 / 2 (ird 2 ) 3 / 4 


e-WWu'iO) 


( 5 ) 


The rhs of (5) is just the Fourier transform of the braced expression on the left. 
Therefore the inverse transformation reads: 



Y [b(p,s')u(p,s') + d*(p' ,s')v(p ' ,/)] 

S* = ±S 


f d x -ip-x/h 

J (2nh) 3 / 2 


1 


(nd 2 ) 3 / 4 


-b| 2 /2rf 2 w l (0) 


( 6 ) 


On the rhs of (6) we have the Fourier transform of a Gaussian distribution. With 
the identity 


f ^ x c -ip-x/h 1 -\x\ 2 /2 d 1 

J (2irh) 3 / 2 ( 7 xd 2 ) 3 / 4 


1 

7 r 3 / 4 


d 

h 


3/2 

e-bl^/2 «^ g(lpl) , 


( 7 ) 


we rewrite (6) as 



Y [b(p,s')u(p,s') + d*(p',s')v(p' ,s')] = ^(IpDw^O) 

s' = ±s 


( 8 ) 


Expanding the sum and using the definitions 
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u(p,s' = +s) = u l (p) , u(p,-s) = u 2 (p) , 

v(p', +s) = u> 4 (p ') , v(p', - s ) = U) 3 (p') , 

we get 


(9) 


moc 


E [b(p, s)u x (p) + b(p, - s)co 2 (p ) + d*(p', s)u 4 (p') + d*(p', -s)w 3 (p')] 

= 5(|p|)w , (0) ■ (10) 

Now we multiply (10), in turn, by [^‘(p)]!, [w 2 (p)]l, [w 3 (-p)]l and [w 4 (-p)]t. 
With [see (6.30)] 


[-> , <p>] t = 


[w 2 (p)] 


I E + moc 2 
2moC z 

I E + moc 2 


1 , 0 , 


1 , 0 , 


PzC 


P-C \ 
E + moc 2 ’ E + moc 2 ) 


■PzC \ 


P + C 


y 2 m 0 c 2 

[ w 3 ( _ p) jt = jE + moc 2 ( -p z c -p.c 


E + m 0 c 2 ’ E + moc 2 ) 

,1,0 


2 ra 0 c 2 \E + moc 2 ’ E + wqC 2 


r, , 4 / ot _ E + moC 2 ( -p+c p z c 

^ V 2m 0 c 2 Y£+m 0 c 2 ’£ + moc 2 ’ ’ 


(ID 


the following relations are valid: 

[c*/(p)]t a/(— p) = [«/'(— p)]^ o/(p) = 0 for i = 1,2 and 7 =3,4 ; 


[u l (j>)] ] u'{p) = 

[u' (~p)] ] (J (-p) = 


moc 

E 


2 6'j 


moc 


2 


for i,j = 1,2 
for ij = 3,4 


Hence, we get the four equations: 
m 0 c 2 . , E 


b(p,s ) j = <K|p|) [u^CpJ^tAO) 

tL MqC 


b(p,s) = \j E + 2 '” 0< g(\p \ ) • 1 , 


f ^Y~ b( ^’~ s>> ~c 2 = 9 (\P\) [^ 2 (P)] + ^‘(0) 

=> b(p , -j) = 0 , 

=5(|p|)[w 3 (~P)] t w 1 (0) 

( ~Pz)c 
E + m 0 c 2 


moc 1 


d*(p', ~ s ) = \j E + 2E ° C g(\p\) 


me , E 

—=~d (p ,s ) =• 

E moc 1 


g(\p \ ) [lA-p^tAo) 


, E+m 0 c 2 (-p+)c 
d (P > 5 ) = V TS S(IpDi 


2E 


E 4 - m 0 c 2 


( 12 ) 


Exercise 8.5. 
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We now insert these equations, as well as (9) and (4), into (2), which yields: 

\ 


X 


1 ^ 


0 


PzC 

e -iE,/h + 

E + moc 2 


P+c 


\ E + m 0 c 2 / ) 


V 


(E + m 0 c 2 ) 2 
PzP+c 2 
(E + moc 2 ) 2 
~PzC 

E + m 0 c 2 . 

0 / 


x e' Et / h 


+ 


f d 3 p 

J (Inh) 2 ' 


(Px +Py)c 2 \ 

~ 

(E + m 0 c 2 ) 2 


-PzP+c 2 

e i Et/h 

(E + m 0 c 2 ) 2 

0 

- P+ c 

' E + ttiqc 2 / 


i> 

( E + m 0C 2 r - iEt /h , P 2c2 
' 2E 6 + 


x 


2 E(E + m 0 c 2 ) 

rv 


gift / h \ 


0 

(g-iEt/h _ gi Et/h^ 


PzC 

I 2E ' 

y P + c ^Q-iEt/h _ gi Et/h^ 

=/< 

( E + m 0 c2 p—'iEt/h , E - m 0 c2 _i Et/h \ 

2E + 2E 6 j 

0 


J 


X 


v 


-§£ 2 isinf 
2£ h 

P + c . . Et 
■ — — ism 
£ a 


= / 0 ^ 75 e ,p ;E/fi 9(|p|) [ (cos H sin w '(0) 

¥Kfy< o) -'^KtH ■ 


/ 


(13) 


At an arbitrary time t 0, ip(x,t) is thus composed of the following three parts: 
ip(x, t) = ci(®, OaAO) + C 3 (®, t)a7 3 (0) + c${x, t)w 4 (0) . 


The functions ci(a:), C 3 (a;), C 4 (a:) can be calculated only by use of numerical 
methods: 
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-I 


d P e i p-x/h_ 


1 (d 


(2nh) 3 / 2 7T 3 / 4 \ h 


3/2 


■\p\ 2 d 2 /2h 2 


X cos 




WTqC 4 + p 2 c 2 


h 


-t — 1 


c 3 (x,t) 


-/ 


d 3 /> 


(2nhy/ 2 

PzC 


Jp ■ x/h 1 I “ ^ p -|p| 2 d 2 /2ft 2 


Y^WqC 4 + p 2 c 2 


)Jm*c , +p 
/rf\ 3/2 . 

7T 3 / 4 \ k J 

ml c 4 + p 2 c 2 

sm 1 , 

a 


m oC - Y '"o 6 * 4 + ^ 2<:2 

s j n _v ^ 


,2 C 2 


ca(x t) •= / d3p e'P-g/ft . 1 / e _lp|2</2/2/i: 

4( , J - J (2nh) 2 / 2 7t 3 / 4 U; 


Pv - IP* 

X •=.:./ rrz — r Sin 

w ml c 4 + p 2 c 2 


J mlc 4 + p 2 c 2 

1 t-l * i 


h 


(14) 


This result can be understood in the following way: To obtain the wave-packet 
of (1) a superposition of plane waves of positive as well as of negative energy is 
necessary. Here the the coefficients occur in such a way that for t — 0 the u; 3 (0) 
and cj 4 (0) parts of the partial waves of positive energies and those of negative 
energies cancel each other. This can be clearly seen for t = 0 in (13): The third 
and fourth components of the sum of the three spinors vanish. Since the partial 
waves of positive and negative energy behave differently in time [exp(— i Et/h) and 
exp(i Et/h), respectively] this is no longer valid for times t 0. This implies that 
(1) obviously cannot be regarded as a localized electron of spin +s, as it would 
have been in the framework of a single-particle interpretation of the Dirac equation. 
One may wonder why a spin-up electron in time gets admixtures of components 
of negative energy with spin down. The reason for that comes from the fact, that 


[Hf , cr] _ = 2d x p ^ 0 while [H f , p • &] _ =0 


Hence spin is not conserved; only the projection of spin along the momentum p is 
a conserved quantity. 

Let us briefly return to (12), from which we get the relative intensities of the 
partial waves of positive and negative energy: 

R = \b(p,s)\ 2 

b \d*(p',s)\ 2 + | d*(p', -s)\ 2 + \b(p,s)\ 2 

1 1 

p 2 c 2 /(E + nioc 2 ) 2 + 1 (E — moC 2 )/(E + m 0 c 2 ) + 1 

n - hi ti - f- 1 E 

~ : = — — with n = =■ , 

n — 1 + n + 1 2 n m 0 c z 


Rd - 


\d*{p>,s)\ 2 + \d*(p>,-s)\ 2 
\d*(p',s)\ 2 + | d*(p>, -s)| 2 + | b(p,s)\ : 


= 1 


n + 1 n — 1 


2 n 


2 n 


(15) 


Exercise 8.5. 



196 


8. Wave Packets of Plane Dirac Waves 


Exercise 8.5. 


Because of the Gaussian distribution g(\p\) [see (7)] only partial waves with \p\ < 
h/d contributee noticeably, so that 


Ep _ y/ (tripe 2 ) 2 + c 2 p 2 

niQC 2 moc 2 


1 + 


P 2 

TTIqC 2 



( 16 ) 


where we have set p — h/d and d c = h/rn^c. In the figure below we recognize that 
only for d < d c — h/m^c, i.e. only if the width of the wave packet is compressed 
to a size of about a Compton wavelength of the electron, do the partial waves of 
negative energies have an appreciable effect. 


Relative intensities of par- 
tial waves of positive ( Rb ) 
and negative (Rj) energy is 
a Gaussian wave packet of 
width d 




9. Dirac Particles in External Fields: 
Examples and Problems 


example 

9.1 Eigenvalue Spectrum of a Dirac Particle 
in a One-Dimensional Square- Well Potential 

We calculate the spectrum of eigenvalues for Dirac particles in a square-well po- 
tential of depth Vo < 0 and width a. 

For that purpose we decompose the real axis into three domains I, II, III: 
l:z<-a/2 ; ll: -a/2 < z < a/2 ; III: z> a/2 . 

The Dirac equation within these domains reads (see Fig. 9.1) 




\ V 


I 

a/2 

11 + a/2 111 







V 0 



I, III: 

(& • pc + f3nioC I, 2 )tl> = Eip , 

II: 

(a • pc + /3m 0 c 2 )i/> = (E - V 0 )V> 

(V 0 < 0) . (1) 

As the relevant coordinate we choose z . Therefore the spinor is only a function 
of z, e.g. ip = ip(z). 

While the wave functions in I and III are just the free solutions of the Dirac 
equation, the solutions in II are obtained from the free solution by the substitution 
E E — V o. Since no spin-flip occurs at the border of the well, we can restrict 
our discussion to solutions with spin up. The energies of the solutions can take all 
allowed values from — oo up to +oo, so that we can describe particles as well as 
antiparticles. 
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The solution of the Dirac equation (1) are thus given by: 


I. 


ipi(z) = Ae ip ' z/h 


1 l x 

P\c 

E + m 0 c 2 

Vo/ 


+ A't~' p ' z/h 


( 1 \ 

0 

Pi c 

E + m 0 c 2 

V 0 / 


II. 


2 E 2 2 ^3 

2~ m 0 C > Z — ~2 ’ 


/ 1 \ 

( 1 \ 


0 


0 

ipn(z) = B t ip2z/h 

p 2 c 

+ B'e- ip2i/h 

~P2C 


E -V o + m 0 c 2 


E -V 0 + m 0 c 2 

\ 0 / 


V 0 J 


( 2 ) 


(3) 


2 (E - V 0 ) 2 2 2 

p 2 = 5 m n V 


III: 


•- • ~2 ~ Z ~ 2 ’ 


V’iii(z) = C e' p,z / h 

1 ^ 
0 

P\C 

+ C' Q~ ipiZ / h 

( 1 \ 
0 

-pic 


E + moc 2 

Vo/ 


E + moc 2 
VO/ 


(4) 


2 E y 2 a 

Pi = ^2 “ m o c , Z > 2 ■ 


At the borders of the well the wave function must be continuous (because of the 
current conservation d,J p = 0). Therefore we get the condition at the boundaries 


7 — — < L‘ 

2 * 


*(-!)-*■(-!) 


Z = +f: 


This means in particular: 


z — - 


2 * 


e ~ip\a/2h _j_ Q \p x a/lh _ g Q -\p 2 a/2h _j_ g/ ^\p 2 a/2h 


(A, 


-ip\a/2h 


— A' Q l P' a / 2h ^ 


p\c 


E + rriQC 2 


— [B Q~ l P^ a / lh — ft' e ip2a/2h^ 


p 2 c 


z — 


V 0 m 0 c 2 


ft Q ipia/2h _|_ gf Q -ip 2 a/2h _ q ^\p x a/2h _j_ £./ Q -ip x a/2h 
(fi Q [ P 2(l / 2h — ft' Q~ip2Ci/2h^ 


p 2 c 


E -V o 4- m 0 c 2 


_ Q ip\a/2h _ Qt ^-\p x a 1 2h^ 


P\C 

E + triQC 2 


(5) 

( 6 ) 
(7) 


( 8 ) 
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If we define 7 to be 

p\c E — V 0 4- m 0 c 2 


7 = 


E + m Q c 2 p 2 c 


{E - m 0 c 2 ) {E — Vo + m 0 c 2 ) 
{E + m 0 c 2 ) (E - Vo - m 0 c 2 ) 


we can write (5-8) in the following matrix form: 

Z±I J(p\-pi)a/2h 7-1 c i(nt+P)a/2h 

7 7 

7-1 ^-i(pi+p2)a/2h 7+1 e i(P2~Pi)a/2h 


B 

B' 


B 

B 1 


C 

C' 


(9) 


( 10 ) 

( 11 ) 


e -i(Pi+P 2 )a/ 2 h 

7 7 

1 /(I _|_ j^ e >(Pi-pi)a/ 2 h (1 _ j^Q~i(pi+pi)a/ 2 h \ 

2 \(1 — -y)e i 9 , i+t’2) a / 2ft (1 _|_ j^ e i(P2-Pi)a/2h J 

and by inserting (11) into (10) we get in summary 

A \=A- 

A' ) 47 

(1 _|_ -y)2 gi(pi ~P 2 )a/h _ (1 _ /y)2 gi(pi+P2)a/ft Q _ ^ 2 ^-ipia/h _ g i piah^ 

_ 7)2( g i/Wfi _ e ip2a/ft) (J 4. ^,)2g-i(pi-p 2 )a/ft _ (1 _ -y) 2 g— i(fi +P2>a/n 

c 

C' 


( 12 ) 


Thus we have two equations with four unknown coefficients A, A', C , C' . The 
normalization condition is a third equation: 

J ip^rpdz = 1 , ( 13 ) 

Therefore one of the four coefficients A, A 1 , C , C' can, in general, be arbitrarily 
chosen. 

Now we can discuss the solutions to different energies. Let us look at Figs. 
9.1,2 where typical behaviour of the large component of the wave function is 
plotted for different energies. In Fig. 9.1 we have assumed | Vo | to be smaller than 
2moC 2 ; therefore we have to consider four energy ranges. 

(1) E > m 0 c 2 : Free electrons are moving from left to right and are scattered by the 
(attractive) potential. If in domain II the width of the potential is an integer multiple 
of the wavelength, there exists a potential resonance, i.e. one gets an especially 
large probability for finding the electron in the well. 

(2) nioc 2 + Vo < E < moc 2 : Here we find bound states with an exponentially 
decreasing probability for finding an electron in domains I and III. 

(3) —moc 2 + Vo < E < —m^c 2 : Incoming positrons (continuum states of negative 
energy) “feel” a repulsive potential (because of their opposite charge - these wave 
functions actually describe positrons, as well shall see in Chap. 12 on hole theory) 
and will be scattered at this potential. Since the probability of finding an electron 
in domain II decreases exponentially, a large proportion of the positrons will be 
reflected at the repulsive potential. (The transmission decreases with a and increases 
with |E|.) 

(4) E < m 0 c 2 4- Vo: The positrons are scattered at the repulsive potential; again, 
there can exist potential resonances if a is an integer multiple of the wavelength 
in the domain II. 


Example 9.1. 
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On the other hand in Fig. 9.2 the potential is assumed to be | Vq| > Im^c 2 . In 
this case an additional energy domain appears, showing a new behaviour of the 
wave function: m$c 2 -f - Vq < E < -m 0 c 2 . In this domain, bound electronic states 
are possible; however their wave functions do not decrease exponentially, in the 
domains I, III, but they join a continuum wave of the same energy E < ~m 0 c 2 . 
Therefore the probability of finding the particle far away from the well is not equal 
to zero. We interpret this observation using hole theory (see Chap. 12). It means 
that a hole in that state will travel away (as a positron) from the well to infinity, 
and if a state is unoccupied, it will be slowly occupied by an electron of the filled 
Dirac sea. Thus a spontaneous creation of an electron-positron pair is possible. 
An empty bound state will spontaneously be occupied by an electron from the 
negative continuum, whereas the positron will move away to infinity. (Because of 
the spin degeneracy of the state there will even be two positrons: one with spin up 
and one with spin down.) One usually says that in this case the potential well is 
overcritical with respect to spontaneous e~ — c + pair creation. While bound states 
in the energy domain — niQC 2 < E < m 0 c 2 can remain empty without causing any 
instability to the system, it is impossible to keep bound states that “dive” into the 
negative continuum empty for a long time. They will be spontaneously filled up, 
i.e. the hole in this state has a finite decay width. We will see that, in fact, no 
sharp energy levels of bound states exist in the domain mqc 2 + Vq < E < —m§c 2 
(as was the case for —moc 2 < E < moc 2 ), but these wave functions will have a 
reasonating structure that peaks around the expected binding energy of the bound 
state. 

Following these qualitative considerations, we shall now show how these state- 
ments result from the solutions (10)— (12) and we distinguish the two cases: 

(a) \E\ > m 0 c z , i.e, p\ is real . 

(b) |£| < moc 2 , i.e. p\ is imaginary . 

Solutions for (a) are in general called scattering states „ solutions for (b) are bound 
states. 
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Fig. 9.2. The large compo- 
nent of some wave functions 
for different energies. In this 
case the depth is |Vo| > 
2moc 2 



Fig. 9.3. The graphical solu- 
tion of (19) 


First we consider case (b). Here (12) are significantly simplified since A and C' 
have to vanish, so that and 't/’ni do not increase exponentially and are therefore 
normalizable; thus the first of (12) has the form: 


_L jPl “/*■€ ((1 + 7) 2 Q~'Pl a / h 


(1 


7 )2 Q <Pia 


/ft) 


(14) 


Since C 0 (otherwise the whole wave function will be zero), we obtain 

1 7 e -ip2 fl /ft = ^ ~ ^ e +'P2«/ft 

1—7 1+7 


05 ) 
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As long as p 2 is real (in Fig. 9.1 for m 0 c 2 + V 0 < E < m 0 c 2 , in Fig. 9.2 in the 
whole domain -m 0 c 2 < E < moc 2 ), 7 is imaginary (since p\ is imaginary). Thus 
(15) means that 

1 7 e -ip2a/h 

[l — 7 

whereby denotes complex conjugation, or 


1 + 7 
1 - 7 


-\p2a/h 


(16) 


Im f Q-iP2a/h 

\1 -7 


= 0 


(17) 


With 7 = ir (r € R) follows 


2 r 

1 -r 2 



(18) 


Inserting r form (9), we finally have 
a \ 

=f(P2) with 

EV 0 

f(P2) = KC , (19) 

CK 

whereby p\ = in. This is the equation to determine the energy eigenvalues of the 
bound states, which will be discussed immediately. Before this, one should briefly 
mention that (15) has no solution, if in Fig. 9.1 

- moc 2 < E < moc 2 + Vq , 


CP 2 cot [P 2 


i.e. if p 2 is imaginary: 

|l±^!e 2 ^/^i (p2 = iK2) ; (20) 

because (1 + -y) 2 > 1 , (1 — 7) 2 < 1 , e 2K2a//s > 1 , which is the reason why 
the wave functions in this domain vanish identically. The condition (19) can be 
approximately solved graphically (take a look at Fig. 9.3); this is sufficient to get 
a qualitative overview about the behaviour of bound states. Let us do this first; 
afterwards we shall obtain the exact energy spectrum for bound states by numerical 
solution of (19). 

First we note that 


/ a\ he / a\ ( a\ 

; = CP2 c ° t ( K -) = -(„-) c °t( K -) 


P 2“*0 


he 

a 


m 0 c 2 Vo 

-2m 0 c 2 V 0 - V 2 

f(p 2 ) - » +00 (corresponding to £ — > -\-niQC 2 ) . (21) 

C P 2 ' \/Vg - 2moc 2 V 0 

In the case of Vo < 2niQC 2 , f(p 2 — > 0) becomes imaginary. This looks, if graph- 
ically presented, somewhat like like Fig. 9.3. We note immediately, that a bound 
state always exists, independent of the depth and width of the potential well. This is 


JKP2)' 


P2- 
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in accordance with the corresponding nonrelativistic problem, but is opposite to Example 9.1. 

the corresponding three-dimensional problem. Because of the angular momentum 
barrier occuring in the latter case, not every three-dimensional potential well has 
a bound state, but only those below a certain depth V 0 . 

One can solve (19) numerically and thus determine the energy spectrum of the 
bound solutions for several sets of parameters V 0 , a. This is depicted in Fig. 9.4 
for the case of a = 10X e . We note that, with increasing Vo, more and more states 
appear. For Vo — -2.04moc 2 the well becomes supercritical, the lowest bound 
states enter the lower continuum and can there be realized as a resonance in the 
transmission coefficient (s. b.). 



Fig. 9.4. Eigenvalue spectrum 
of bound electrons in a one- 
dimensional potential well of 
width a = 10X e . The ener- 
gies of the dived states cor- 
responding to resonances are 
depicted by dashed lines , as 
they can be extracted from 
the maxima of the transmis- 
sion coefficients (see Fig. 9.5) 


Now we want to consider the scattering states. Again there are several domains: 

1. p 2 and 7 are real. This is the case for E > moc 2 and for E < — m 0 c 2 4- Vo. In 
the overcritical case there is an additional domain m 0 c 2 + Vo < E < —moc 2 . 

2. p 2 and 7 are imaginary. Obviously this is the case for Vo — moc 2 < E < 
Vo + m 0 c 2 . 

We will discuss both cases successively. First we make use of the possibility that 
we can choose one of the coefficients A, A 1 , C, C' freely. We assume that from 
the rhs no wave enters the potential; thus C' = 0 and C is interpretable as that 
part resulting from a wave with amplitude A which arrives from the left, travelling 
through the potential pocket or well. The term proportional to A' stems from the 
wave reflected at the potential. Now we can define a transmission coefficient T 
and a phase shift 6 by 

^ = Vfe- iS , ( 22 ) 

A 

i.e. the amplitude of the outcoming wave is reduced by a factor y/T and shifted by 
the phase 6 compared to the wave impacting from the left. From (12) we obtain 
for real p 2 ' 
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The phase follows from [see (12)] 


-L=e‘«e-<^=c„ 1 (p 2 |)- i i±f si „(p4) 


and 


1 

vf 


[ isin ( 4 -x) +cos (*-xI 


( 23 ) 


(24) 


(25) 


The separation of real and imaginary parts and the elimination of \ff yields 



If, however, p 2 is imaginary, then instead of (23) we get from (12) the transmission 
coefficient 



(whereby 7 = i J 1 and p 2 = in 2 as before), and instead of (27) the phase shift reads 



In Fig. 9.5 the transmission coefficient for a potential of the depth Vo = -3m 0 c 2 
and the width a = 300h/moc is depicted. Now we choose an overcritical potential, 
since the undercritical case differs only by the omission of the domain /n 0 c 2 + Vo < 
E < —moc 2 . We have significant structures of resonance in the electron continuum 
for E > moc 2 and in the positron continuum for positron energies above the 
potential barrier |£| > |V 0 - m 0 c 2 1; (E < -m 0 c 2 + V 0 ). Positrons with lower 
kinetic energy (— moc 2 + Vo < E < m§c 2 + Vo) only penetrate the barrier with a 
probability which decreases exponentially with the width a of the barrier. Hence T 
is about zero in our case. In the domain m 0 c 2 + V 0 < E < —moc 2 , however, there 
is the possibility that the incoming wave meets an overcritical, quasi-bound state, 
and thus penetrates the potential domain more or less unhindered. At the point 
where by extrapolation of the spectrum of the bound states in Fig. 9.4 one would 
expect the quasi-bound state, T is equal to 1. The dived bound state in this way 
becomes perceptible as a resonance in the scattering spectrum below E = —moc 2 . 
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Fig. 9.5. The transmission co- 
efficient for scattered states 
of a one-dimensional square- 
well potential of depth Vo = 
-3moc 2 and width a = 10X e . 
The energies of the bound 
states are depicted by dashed 
lines 


These resonances do not exist for subcritical potentials; their interpretation as a 
signature for spontaneous pair creation has already been discussed in the qualitative 
discussion above. 1 

Finally we look at the scattering phase in Fig. 9.6. As can already be seen from 
(23), (27), (28), (29), one gets T — 1 for 6 -p\(a/h) = 0(mod7r), and T becomes 
minimal for 6 — p\(a/h) — ^(mod7r). If T is minimal the reflection coefficient 
becomes maximal. Hence this statement is in agreement with the well-known state- 
ment from scattering theory that the scattering cross-section becomes maximal if 
the scattering phase passes through (“Scattering” in the one-dimensional case 
is identified with reflection at the potential well.) 

Thus in the domain m 0 c 2 + Vo < E < - m 0 c 2 we can determine the energies of 
the dived states, where (<5 — p\a/h) = 0(mod7r). These energies lie exactly where 
one would expect them by extrapolation of the bound spectrum (see Fig. 9.4). 



Fig. 9.6. The phase shifts of 
scattering states of a one-di- 
mensional well of depth Vo = 
-3moc 2 and with width a = 
10X e as a function of energy 
E. The energy E is given in 
units of moc 2 


1 For greater detail see W. Greiner, B. Muller, J. Rafelski: Quantum Electrodynamics of 

Strong Fields (Springer, Berlin, Heidelberg, New York 1985). 
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EXERCISE 


9.2 Eigenvalues of the Dirac Equation 

in a One-Dimensional Square Potential Well with Scalar Coupling 


Problem. Investigate the eigenvalues of the Dirac equation in a one-dimensional 
square- well potential, with depth V 0 and width a, if this potential is not coupled 
as a time-like component of a four- vector, but like a scalar. 


Solution. As in Example 9.1 we define three domains I(z < a/2), II(— a/2 < 
z < a /2) and III(z > a/2). In these cases the Dirac equation takes the following 
respective forms: 

I, III : (& • pc 4 - f3moc 2 )ip = E ip , 

II : (a • pc + $ (m 0 c 2 + V 0 ))ip = Eip ( V 0 < 0) . (1) 


In contrast to the vector coupling, which in domain II leads to the replacement E — > 
E - Vo, we now must replace /n 0 c 2 by m 0 c 2 + V 0 in II. While vector coupling acts 
differently on electron and positron states, respectively (if electrons in the square 
well are attracted, positrons are repelled, and vice versa) and thus the eigenvalue 
spectrum is not symmetric (bound states exist for only one of the two kinds of 
particles), scalar coupling acts equally on particles and antiparticles. Alternatively, 
one can say: For vector coupling the potential couples to the charge (which is 
different for particles and antiparticles); for scalar coupling the potential couples to 
the mass (which is equal for both particles and antiparticles). In the latter case we 
thus expect a symmetrical energy eigenvalue spectrum, i.e. for both electrons and 
positrons there will exist bound states. Hence we expect a supercritical behaviour 
even for Vo < — m 0 c 2 . In this case, in principle electron and positrons states can 
cross. What happens then will be discussed in detail a little later. Now, though, 
let us proceed analogously to the case of vector coupling. Again the momentum in 
the regions I, II is given by 


2 E 2 2 

Pi = -j - me ■ 

C L 

In domain II, however, we get 


2 E 2 ( , Vb\ 2 

P2 = __Uo C + _j 


( 2 ) 

( 3 ) 


Again we can write down the conditions for continuity of the wave functions 
at z = — a/2 and z = a/2. Analogously to the case of vector coupling, we define 

_ Pie E + m 0 c 2 + V 0 _ j (E - m 0 c 2 )(E + m 0 c 2 + V 0 ) 

^ E + moc 2 p2C y (E + me 2 )(E - me 2 — Vo) 

One should note that 7— > 1/7 if E — > —E. Using the same notation as in the 
case of vector coupling, we can work with (12) from Exercise 9.1, if we observe 
that 7 is now given by (4): 
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x 


A\ 1 


A' J 47 

'(1 4 c Hp\-p 2 )a/h — (l — ^2 Q i(p ] +p 2 ) a /h ^2 — 7 2 )(e~^ 2 “/ n — e i/W^) 

_ ga 2 )(t~ xp2a ^ h — t' Pia / h ) (1 + 7) 2 e _i 0 ?i -/ 7 2)«M — (1 — 7) 2 e -i^i+P2)fl/^ 


x 


C 

C' 


(5) 


Exercise 9.2. 


We divide the further discussion into several steps: 

(1) \E\ < mQC 2 -, bound states 

(2) IE | > moc 2 ; scattering states in the bound electron or positron continuum. 

Let us first consider the bound states. For these the following eigenvalue equation 
holds: 

1 7 e -i|>2a/ft _ ^ ~ 7 & ip2a/h _ (g) 

1 - 7 1+7 

This equation only has solutions if p 2 is real, i.e. |E| > |+moc 2 +Vo| \M > nioc 2 +Vo 
for electrons, E < — (moc 2 + Vo) for positrons]. Thus, it again follows from (6) 
that 


tan 



2 r 

i - r 2 


with 


(7) 


7 = i r (r € R) and 

c p 2 cot(p 2 l)=-(?^+K l c')=:f(pi) . ( 8 ) 

whereby p\ — \K\ . Of course this equation is symmetric in ±E, too. Furthermore 
the following relations hold: 


f(Pi) 


f(Pi) 

f(Pi) 


P 2 - 


\j- 2moC 2 V 0 - V 0 2 - pic 2 


m 0 c 2 V o + V 0 2 
*° y/-2 moC 2 V 0 - V 0 2 


|£|— *m 0 c 2 


On 

cl 

VI 

< V 

/-2m 0 c 2 V 0 - V 0 2 J 

• (9) 

for 

|Vb| 

< moc 2 


for 

|Vb| 

> moc 2 , 

(10) 

for 

V 0 - 

-> —2moc 2 


P2^ J' 

-2m 0 V 0 - V 2 /c 2 ) 

. (11) 


Since Vo < >n<)C 2 , the graphical solution of (9) looks similar to the case of vector 
coupling (see Fig. 9.7), except that one has to imagine Fig. 9.3 in Example 9.1 to 
be continued to negative energies in a symmetric way. 

If now |Vo| increase, the intersection point of f(pi) with the ordinate in Fig. 9.7 
is shifted upwards. Correspondingly the lowest eigenvalue decreases to smaller 
values of pi. In case of 

(m 0 c 2 V o + V 0 2 ) hc_ 

y / -2m 0 c 2 V 0 - V 0 2 a 



208 


9. Dirac Particles in External Fields: Examples and Problems 


Fig. 9.7. Graphical solution 
of (8) 



the lowest eigenvalue vanishes, namely at p 2 = 0 or \E\ — \ m 0 c 2 + Vq| > 0. 
Consequently the deepest bound state never reaches the energy E = 0 and the 
corresponding electron and positron states never overlap. This does however not 
mean that the eigenstate does not exist any longer for larger values of |Vo|- Rather, 
n ow there exists a soluti on of (6) with imaginary momentum p 2 = i/^ 2 , ^2 = 
y/(moC + Vq/c ) 2 + E 2 /c. The transcendental eigenvalue equation analogous to (8) 
reads in this case 

^(*2 a \ ( moc 2 Vo , ~\ z, , 

c/c 2 coth y — j = - I + k\c I — :/(k 2 ) • 

A graphical construction similar to the one in Fig. 9.1 reveals that there is always 
exactly one solution for 

moc 2 Vo + Vq 2 ^ he 

y/~ 2m 0 c 2 V 0 - V 0 2 a 

For Vo 00 it monotonically decreasingly approaches the value — ► — m 0 c — 

Vq/c corresponding to E > ±0. 

If we further increase |Vq|, the ordinate cut of f{p 2 ) still moves upwards and the 
higher states approach the eigenvalues p 2 — rnrh/a with the eigenenergies E 2 = 
(nir /a) 2 (hc) 2 + (tyiqc 2 + Vq) 2 . For \Vq\ — > IniQC 2 , the quantity f(p 2 = 0) diverges 
and the eigenenergies come close to the value E 2 = (hit / a) 2 (he) 2 -Vm^c 4 > m^c 4 . 
That means no more bound states exist for \ Vq\ — » 2niQC 2 \ with increasing potential 
depth all bound states vanish, one after another. This behaviour is illustrated in 
Fig. 9.8, which was obtained by solving (8) numerically. The energy diagram shown 
is characteristic for the square-well potential. [One can also consider the same 
problem in three dimensions with a Coulomb-like potential of the form —a'/r 
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Fig. 9.8. Spectrum of eigen- 
values of the Dirac equa- 
tion with a one-dimensional 
square-well potential of 
width a = 10X e as a func- 
tion of the potential strength 
Vo (scalar coupling) 


(see Example 9.8); then, all states in the diagram are conserved for arbitrary high 
coupling strength a', too. However, their binding energies approach the value 
\E\ = 0 only asymptotically (cf. Fig. 9.9) and also in this case electron and positron 
states do not cross.] 



Fig. 9.9. Eigenvalue spec- 
trum of the three-dimensional 
Dirac equation with a /r po- 
tential (scalar coupling) 


Therefore we find, in contrast to vector coupling, that in the case of scalar 
coupling spontaneous e+ — e~ pair creation never occurs, no matter how strong 
the potential chosen. This qualitatively different behaviour of the bound states 
in case of the a! /r potential is easily understood in the following way: Due to 
the scalar potential the electrons obtain an effictive mass m e ffC 2 = rriQC 2 + Vo. 
Figure 9.10 schematically shows this effective mass as a function of r for the a' jr 
potential, together with some bound states. 

One sees that a region with m e ff < thq always exists, so that bound states are 
always possible (i.e. for all values of the coupling strength a')- With increasing 
parameter a ' the wave functions are shifted to larger values of r. Simultaneously 
the potential bag is broadened and hence the energy eigenvalues \E\ decrease and 
become zero in the limit a' — > oo. Since ra e ff > 0 always holds, then \E\ > 0 
is always valid, too, and electron and positron states can never cross. The energy 
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Fig. 9.10. Square of the ef- 
fective mass (/n e ffC 2 ) 2 = 
\»ii)C 2 4- V (r)] 2 as a function 
of the radius r 



gap between electrons and positrons becomes smaller and smaller, but never zero; 
therefore, e + -e~ pair creation without energy (i.e. spontaneous pair creation) can 
never occur (see Figs. 9.8 and 9.9). 

Finally we give a brief qualitative discussion of the scattering states |£| > 
m 0 c 2 . Again we consider the case C = 0 and study the transmission coefficient 
T — ; C j A | 2 . Of course, the relation \E | > moc 2 + Vo also holds in the energy region 
|£| > m 0 c 2 . For V 0 < -2 m 0 c 2 the electrons feel a potential barrier if they lie in 
the energy interval m 0 c 2 < E < \m 0 c 2 + V 0 \ (similar statements hold for positrons). 
While for |Vb| < 2nuyc 2 resonances occur in both continua (in complete analogy to 
Fig. 9.5 in Example 9.1), in the region E > moc 2 one obtains for |V 0 | < 2moc 2 an 
additional energy region, where electrons and positrons feel a potential barrier, and 
the transmission coefficient becomes very small (corresponding to the behaviour 
in the energy interval - m 0 c 2 + V 0 < E < m 0 c 2 + V 0 in Fig. 9.5). 


EXAMPLE 


9.3 Separation of the Variables for the Dirac Equation with Central Potential 
(Minimally Coupled) 


The Dirac-Hamiltonian in this case reads 


H D = ca • p + fimoc 2 + V (r) , ( 1 ) 

where V (r) = Mo(r). Because of the spherical symmetry of the field, 2 the angular- 
momentum operator J and the parity operator P = q' v (3(x — > -x) = e iv, 7°P 0 
[see (4.9)] with respect to the origin of the coordinate system commute with the 
Hamiltionian. Hence states with definite energy, angular momentum and parity 
occur. The corresponding wave functions are denoted by 


| V)7m(®>0 

\ X/7'm(®) t ) 


( 2 ) 


Here tpj im and Xji'm are two-spinors which are to be determined. Snce ip ]m must 
have good parity and the parity operator reads P = e iv pP 0 (Pq changes x into 
—a;), we have 


2 See W. Greiner, B. Muller: Quantum Mechanics - Symmetries, 2nd ed. (Springer, Berlin, 
Heidelber 1994). 
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^jm > O — i O Or Pl/jj m (x ) — X'lpjmix ) , 




J m 


— 


— ^ 


0 j p ( 'Ipjlmi&it} 
-I/ °VX;7'm(*,0 
f l Pjlm(3'i 0 ^ 

\ Xy7'm(® j 0 / 


gi<^ | f > O i Pjlm(.'K > 0 
\ ^0 Xjl'mip^^ 0 


(3a) 


(3b) 


where |A| - 1 [in (3b) we have replaced a;' by x\. One should clearly recognize 
the content of (3 a): the demand for “good parity” of the wave function means 
that the wave function is an eigenfunction of the parity operator P [see (4.9)]. 
Equation (3b) shows that the parity of the two-spinor </?;/„, must be equal to the 
negative parity of Xji'm- We can also understand this statement in the following 
way: Starting with the stationary Dirac equation Hoip — Exp, we get with 


a — 


0 <r 
<x 0 


Example 9.3. 


that 

c * P) X + moc 2 (p + Vp = E<p 

c - p) <p - m 0 c 2 x + Vx = Ex 

or 

(E - m 0 c 2 -V)(fi = c{&’p)x 
{E + m 0 c 2 - V)x = c(&-p)<p 


(4) 

(5) 


Since the operator & • p changes parity, these equations show that the two spinors 
ip and x must have opposite parity. 

Eigenfunctions of the angular momentum and the parity operator are the well- 
known spherical spinors. To avoid confusion with the complete wave function IF, 
we shall denote the spherical spinors here by f2ji „ t . They are defined by 


fym = ( l {j\ m ' m s m ) Y lm'Xi ms 

m f ,m s 


(6) 


Here the two-spinors X^ m$ are eigenfunctions of the spin operators S 2 = h 2 a 2 /4 
and S 3 — they read explicitly: 

»>i-G) ■ »h-0 

The parity of Qj\ m is given by Yi,,,: 

Pofylm = (-1 ) l n jlm . (7) 

We make the following ansatz 

^jlm ~ i ’ 

Xji'm = -f{r)Ojv m (y) ( 8 ) 


with 
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(2(l + \)-l = l + \ for j =1 + 2 
l' — 2j — l =: l V 2 ' 2 . (9) 

\2{l -\) - l =1 -\ for j = l - i 

Let us repeat this once more: Because of (6) either j I — j or j =1 — 2 If 
j = l + 5, the orbital angular momentum l' of Xn'm is /' = / + 1. This is the only 
way to realize the opposite parity of % compared to ip. The value V — l — \ must 
be excluded, because no total angular momentum j — l + \ can be constructed by 
l' — l — l and S = The arguments follow a similar pattern for the second case 
of (9). One has 

<3- • ppjim = 6- -p (ig(r){2ji m = (<r • pig(r)) £2 jlm + i g{r)& • pQj im 

= ' 7) + i g(r)a- • pftjim . (10) 


With respect to (6) the spherical spinors are eigenfunctions of the operators L 2 , J 2 
and S' 2 = ( (<t ) 2 with eigenvalues /'(/'+ 1 )/r. /(/’ + I )/r and ^ Ir respectively. To 
be complete we once more give the explicit form of the for the useful cases 
j = l + \ and j = l- \ (J > |): 


/ Ij+rn 


^ I+i ,l,m — 


V 

2 j 


J — m 


\ V 2 7 / ’"'+5 / 




/ j j - m + 1 

2;+2 




j +m + \ 


V V 2;+2 , > m+ 2 / 


(11a) 


(lib) 


The root factors (11) are the Clebsch-Gordon coefficients in explicit form. Now 
we make use of the following relation between the spherical spinors 



( 12 ) 


which is easily proven, because (<x • r/r) is a scalar operator of negative parity 
[see (10.54) and the following]. With (12) we get 

-(<7 • p)Qj !m =(&>p)(&-^ n jl!m . (13) 

Now we take the already familiar relation 


(<x • A)(a- • B) — A • B + i<r • {A x B) (14) 

to change (13) into 

-{a • p)f2 jlm = (p • ^ + i<r • (p x Q jVm . 

With p = -i/) V and L = r x p, (15) can further be transformed into 


( 15 ) 
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(p • r + i<r • (p x r )) - 




= ( — ift(V • r) — ifir • V — i& * (r x p )) 

= (-i/y - iftr (-y) - i^) O jVm 
.(2 h \* A „ 


From 

J 2 = 4- = L 2 + + h& • L 

follows 

hL • &O jVm = |j 2 - i 2 - j Oji> m 

= {/(/ + 1 ) - /'(/' + 1 ) - 1 } M jrm 


(16) 

(17) 


(18) 


Now and in the following it is convenient to define a quantum number k, by 


* = T (J + 




(/ + 1 ) 


for j = l + \ 
for j = l - \ 


(19a) 


Obviously there is always 

M =7 + 5 or J = M - \ • (19b) 

With this and taking l' — 2j — 1 into account one can rewrite the expectation value 
on the rhs of (18). For j — l + ^ one gets 

(/ + I)(/ + I + l)-[2(/ + I)-/] [2(Z + I)-Z + 1]-| 

— / 2 4" 4* / 4" 4* ^ 4- ^ — (/ 4* 1)(Z 4* 2) — ^ 

= / 2 + 2/ - / 2 - 2/ - l - 2 = -(/ + 2) = k - 1 . (20a) 

Similarly for j = l — 

= = = . (20b) 

Now (16) can be written as 

(2 h + L- &)O jVm = (1 + K)M2 jVm . (21) 

If in (18) we had started with Oji m we would have obtained (2 h 4- L • (r)Qji m = 
(1 - K)h{lji m by the same procedure. In the literature 3 the following notation is 
often used for (21): One writes 




= On 


•jlm 


X — K,m — 


<]Vm 


(21a) 


Example 9.3. 


3 See, for example, M.E. Rose: Relativistic Electron Theory (Wiley, New York, London). 
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and defines the operator 

k = h + L • & , 

so that due to ( 21 ) the eigenvalue equation 

~ ) kX — K,m — foftX — Kjm 

holds, where 

-(/ + 1 ) = -(/' + \) for ;=/ + ! 


( 21 b) 


( 21 c) 


K = 


/ — + (/ + !) for j — l 


and |k| = j + l 


Consequently, the spherical spinors of ( 8 ) can also be denoted by Xn,m — fyim and 
X-K,m = fyi'm and we can therefore write for the four-spinor in a central field 


= 


_ ( <Pjim(x,t ) \ _ f ig(r)fyi m ( 7 ) 


_ ( i gi.r)XK,m 

-f(r)X-K., 


(7) 

= , i ( g(r)x.,m 

\\ f(r)x-K,m 


Sometimes we shall use this alternative notation. With this and (15) and (12), 
equation ( 10 ) finally takes the form 


- - r> fk^g , K+ 1 t , , 

o' • ptfjim — vtjVm I "b f^gir ) 

Analogously one derives an expression for ( a • p)\, namely 


(<7 • p)Xjl'm — il^/7m I ^ 


; d/(r) k -1 

dr r 


frf(r) 


(22) 


(23) 


Now the expression (22) and (23) are inserted into (5). The angular functions from 
both sides of the equation can be eliminated. So we obtain the differential equations 
for the radial functions / and g: 

^ C ~dr~^ + (1 + *)hc^p- — \_E+ m 0 c 2 - V(r)\f(r) = 0 , 

+ (1 - K)hc f -±p- + [E - m 0 c 2 - V (r)] g(r ) = 0 . (24) 

With the substitution 


G = rg and F — rf with 


d G d g(r) 

d7 = 9+r ^r 


1 d F „ d f(r) 
and — +f + r — — 
dr dr 


one finally gets 
dG(r) 


hc- 


hc 


dr 

d F(r) 
dr 


+ hc—G{r) — [E + m 0 c 2 — V (r)] F(r ) = 0 


- hc—F(r) + [E - moc 2 - V(r)] G(r) - 0 


(25) 


(26) 


These are the frequently used coupled differential equations for the radial wave 
functions F and G of the Dirac equation in the case of a spherically symmetric 
potential V(r). In the literature one occasionally sets G — u\ and F = u 2 . 
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EXERCISE 


9.4 Commutation of the Total Angular Momentum Operator 
with the Hamiltonian in a Spherically Symmetric Potential 


Problem. Show that the operator of the total angular momentum J commutes with 
the Hamiltonian of a Dirac particle in a spherically symmetric potential. 


Solution. In analogy to the Pauli theory for electrons with spin we define the total 
angular momentum as the sum of the orbital angular momentum L and the spin 
angular momentum with 


SJ = 


a 0 

0 <T 


( 1 ) 


J = L+\ht! = L + S . (2) 

The one-particle Dirac Hamiltonian for a spherically symmetric potential Ao(r) 
which is minimally coupled as the fourth component of a four-potential reads 

H d = ca • p + j3m 0 c 2 + - A 0 (r ) . (3) 

c 

First we investigate the commutator [ L,Hd ]_. Since L commutes with 3 and the 
spherical symmetric potential A 0 (r), it only remains to calculate [L, a • p]_. We 
restrict ourselves to the calculation of the commutator for the L x component and 
get: 


[L x ,a-p]_ = L x (a x , p x + & y p y + a z p z ) - (a x p x + a y p y + a z p z ) L x , (4) 

L x = yp z - zp y . (5) 

Since L x commutes with p x , it follows from (4) that 

\L x , oi • pj _ = oty | L x , p y j _ + Oi z \L X ? Pz ] _ 

= Oty [yPzPy -ZPyPy ~ P y yPz + PyZPy) 

+ (yPzPz - ZPyPz - Pz.yPz 4 -PzZPy) 

= OLy (yPzPy - PyyPz) + ( PzZPy - Zp y p z ) 

= &y ( yPzPy - (p y y)Pz ~ ypypz) 

+ &Z (( PzZ)Py + ZPzPy - Zpypz ) 


= &y ( ~P y y )Pz +&z(PzZ )py ■ 

(6) 

With pi — i hd/dxj we then have 


[L x , a • p] _ = ih {dcypz - a z p y ) , 

(7) 

and generally 


[Li , & • p] _ = i h ( ajpk - &kPj ) - 
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By cyclical permutations of the indices the final result is 


[L,d-p]_ =i h(a xp)± 0 . (8) 

Equation ( 8 ) states that the orbital angular momentum L does not commute 
with the Hamiltonian and hence it is not a constant of motion. Now we investigate 
the commutator of spin angular momentum with the Hamiltonian: [hE /2,H^}-. 
In this case we restrict ourselves to the component [HE X /2]_, too. It is convenient 
to introduce the 4x4 matrix 7 ^: 

75 with 7s 2 = 1 • (9) 


The following relations hold: 


f 

-7 5 ° = 


0 n 

b 0 


0 a 
a 0 


a 0 
0 a 


= E 


( 10 ) 


A / 

-<*75 


-i 5 E = 

, 

-^75 = 
PE = 

Ep = 


0 -a 
-a 0 

0 11 
11 0 


a 0 
0 -a 


n 0 
0 -n 

a 0 
0 a 


0 -11 

-n 0 


0 a 
a 0 


0 

-n 


a 0 
0 a 

= & 

0 a 
a 0 


<7 

0 

a 

0 


0 

a 

0 

a 


— OL 


and therefore 

[&£]_= 0 , 

[ 75 ,q :]_=0 and 

[ 7 s ,£]_=0 . 


( 11 ) 

( 12 ) 

(13) 

(14) 

(15) 

(16a) 

(16b) 

(16c) 


Thus there remains only the calculation of the following commutator: 
\h{Z x ,dt'P]_ ^ ~\h^ 5 [a x ,a.p] 

due to ( 16 b) 

= -^75 (<** { & xPx + &yPy + OtzPz] - {&xPx + &yPy + &zPz} &x) 

= -^75 {a x a y p y + & x a z p z - &ypy& x - &zPz& X } 

= - 5^75 { (®x&y - a y a x ) p y + ( a x a z - a z a x )p z } . ( 17 ) 

Because of the commutation relations of the a matrices, 


™x 7 &x ™z ™z 


(18) 


and it follows that 
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2 ^ \_^x , OL • p] _ {OLyQL x py + & z & x Pz } 

= 7sM -7^ (-75 ^x) Py - 7 s4 (-75^) Pz } 

73 A ~b ^'z^'xPz} ■ (19) 

due to ( 16 c) 

Now the commutation relations of the Pauli matrices are 

®x®y — —O’yCJ x = i& z , 

®y®z — ®z®y — ? 

a z a x = ~Vx°z = i<5-y • (20) 

Hence one gets 

2 ?i [27* , o: • p] _ /fYs ( 127 z py + i2yj,p z ) i^l (oi z py QyPz) . (21) 

Comparing (21) with (7) we recognize that the component J x of total angular 

momentum indeed commutes with the Hamiltonian. Cyclical permutation of the 
indices results in 

[J,// D ]_ = [J,6c-p]_ -0 . (22) 

We know 4 that J 2 and any component of J, for example J z , can be simultaneously 

A A A A 

diagonalised with H. The fact that J, but not L and S, provide good quantum 

numbers in the Dirac theory (i.e. that J commutes with Hd) is due to the spin- 
orbit coupling ~ L • S contained in H D (see Chap. 11, the Foldy-Wouthuysen 
tranformation). It causes a coupling between spin and orbital angular momentum, 
and hence only the total angular momentum is a constant of motion. 


EXERCISE 


9.5 A Dirac Particle in a Spherical Potential Box 


Problem. Establish the solutions of the Dirac equation 

(a • p + j3moC 2 )ip(r ) = [E - V(r)] tp(r) 

in a spherical square-well potential 

V(r) f-^o<0 for r < R 
l 0 for r > R 


( 1 ) 


( 2 ) 


4 This is covered in detail in W. Greiner: Quantum Mechanics -An Introduction, 3rd ed. and 
in W. Greiner: Quantum Mechanics - Symmetries, 2nd ed. (Springer, Berlin, Heidelberg 
1994). 
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Solution. First we write the operator & • p of kinetic energy in spherical polar 
coordinates. This is achieved by using the identity 

V = e,'(e r 'V)-e r x(e r xV) = e,'|--~xl, (3) 

Or hr 

with L — -\h(r x V). It follows that 

a-p = -iha r -^- - • (e r x L) . (4) 

Making use of 

(& • A) (& • jB) = A • B + i£ • (A x B) , (5) 

we obtain (with A — e r , B = L) 

a r • (a * L) = e r * L + i£ • (e r x L) = \E • (e r x L) (6) 

Multiplying by 



from the right results in 

a r • (27 • L) = ia • ( e r x L) (7) 


Therefore (4) reads 

a • p = — i ha r — + i — (U - L) = 
or r 


-lOL r 


d 


0 * 


„ +---K 
Or r r 


( 8 ) 


where we have introduced 

k = 0(£-L + h) . (9) 

We obtain the eigensolutions of (1) by a separation ansatz, i.e. 


Tp(r) = 


i , <p) 


where the Xk,h are the eigenfunctions of the angular dependent part: 
(& • L + h) Xk,h = > 

(&-L + H) x~k,h = hnx-K^ , 

JzXk,[i ^PXk,^ > k = (— 1,1, 2, 2, — 3, 3 , . . .) , 


( 10 ) 


( 11 ) 


[see Example 9.3, (21) and also Chap. 10, (10.30-37)]. 

With the help of the results of Example 9.3, two coupled differential equations 
for g(r) and f(r) follow from (1), namely 


[E — V (r) — m 0 c 2 ] g(r ) = he 
[E -V (r) + m 0 c 2 ]f(r ) = he 


d 1\ k 

T + ~ +- 

dr r } r 


m 


d 1 k 

i ' — 

ar r r 


9(r) 


( 12 ) 
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Often it is more convenient to use 
«i (r) = rg(r) , u 2 (r) = rf(r) 

for which the differential equations read: 


/ u\ (r) \ _ — 7 ^ (E + me 2 — V (r)) 

dr\u 2 (r)J _\_^ E _ mc 2 - V( r )) 


K 

r 


x ( u ^ r A 

\u 2 (r)J ' 

For constant values of Vo. (14) has the following solutions: 

( 1 ) If 

h 2 k 2 c 2 = (E + V 0 ) 2 - m%c 4 > 0 
M,(r) = r [a\ji K (kr ) + a 2 y u (kr)) , 
k hekr 


u 2 (r ) = 


|k| E + V o + m 0 c 2 


{a\jt_ K (kr) + ci 2 y,_ K (kr)) 


with 


{ k for K 

— At — 1 for K 

= {~ K 
~ K \k -1 


>0 

<0 


for — k > 0 
for — k < 0 


(ii) If 


h 2 K 2 c 2 = mfc* -(E + V 0 Y > 0 
«i(r) = 
u 2 (r) = 


2 4 


2Kr 


7 r 


(p\Ki K+ ]/ 2 (Kr) + b 2 Ii K+i / 2 (Kr)) , 


HcK 


2Kr 


(13) 


(14) 


(15) 


(16) 


E + V, + m a c *V 7T (- i ’' Sr <-+iM & ) + W I .« + i/ 2 (*rr)) . (17) 

The ji and 37 are the spherical Bessel functions of the first and second kind 5 and 
the K [+ / 12 are the modified spherical Bessel functions. Their asymptotic behaviour 


is: 


jn(.Z) 


1 


y n (z) 


z— »o ( 2 n + 1 )!! 
—(2 n - l)!!z 


z— »o 


— n — 1 


1 


(18) 


^/n + lMz)-^ (2n + 1)!r 




( 2 n - l)!!z~ n-1 


(19) 


The irregular solutions yi are also noted in the literature as spherical Neumann functions 

m. 


Exercise 9.5. 
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7 r 

2 ~ + 1 /2 (^ ) ■ 


7 r 

2^^«+i/2(z)' 


( 20 ) 


Before continuing, we want to give the representation of the <p) in (11) 

[see Example 9.3 and Chap. 10, (10.32)]: 

XkA$’ ( P)= { l *li A* ~ m i m ) (21) 

m = - 1/2, 1/2 

with 


X i i = , n 
2 ’ 2 \0 


*i.-i U 


( 22 ) 


Let us now find the bound states. 6 For these, E > V 0 + m 0 c 2 and -mqc 2 < E < 
mac 2 . In the inner region of the potential field we must therefore take the solutions 
(15) and set <22 = 0, in order that the wave functions remain normalizable at the 
origin. On the other hand in the outer region we must set b 2 = 0 in (17), so that 
the wave functions are normalizable at infinity. Both solutions must be joined at 
r = Rq. One can eliminate the normalizing constants a t , b\ by adjustment of the 
ratio m 1 /m 2 at r = Rq , This gives 


K R oji K (kR 0 ) 
«| kRoji_ K (kRo) 


(E + Vq + mqc 2 ) 


RoK, k+ i(KR 0 ) 

kr 0 k Ik+ ^kr 0 ) 


(E + mac 2 ) 


(23) 


and 


ji K (kRa) _ K k K I k + ±( KR o) E + mqC 2 

ji-S kR 0) M K K,_ k+ , (KRq) E + Va + m 0 c 2 (24) 

with 

hck = ^(E + Vq) 2 - m 2 c 4 , 

hcK = yj m 2 c 4 - E 2 . (25) 


For |«| = 1 one can further simplify the equations analytically, and for 5 states 
(« = —!,/« = 0, /_ K = 1) this results in 


kRg sin kRq _ ( k q~ kr o E 4 - m 0 c 2 

sin kRq - kRq cos kRq ~ + K e~ KR o(\ + 1 /KRq) E + Vq + m 0 c 2 

After some transformations one gets 


tan 


f \/ ( E+V„Y-mlc< 


E + Vp + mqc 2 
E + Vq — mqC 2 


X 


he 

1 

1 

1 mqc 2 — E 

~Ro 

E + Vq + mqc 2 

E + ntoc 2 

V mqc 2 + E 

J 


(26) 


( 27 ) 


W. Pieper, W. Greiner: Z. Phys. 218, 327 (1969). 
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Analogously one obtains for p x / 2 states (k = 1, /* = 1, /_* = 0): 

kRo sin kRo K 1 E + V 0 + moc 2 

sin kRo — kRo cos kRo k 1 + 1 /KRo E + moc 2 


and 


tan 


Yc/ {E + y o) 2 - m 2 c 4 


I E + Vo - m 0 c 2 
E + V 0 + me 2 


x < 


he 

R~o 


1 


+ 


1 


E + V o — moc 2 moc 2 — E 
Another form for (26) is (defining a = kRo) 


+ 


I moc 2 + E 
moc 2 — E 


> = 1 


c* cot 


(moc 2 /hc)R 0 y/l-(E 2 /m 2 c*) + 1 
a 1 1 + E/ttioc 2 


x 


i+ v i+ (^) : 


and for (28) one can write: 


1 — a cot a = 


(m 0 c 2 /hc)Roy/l-(E 2 /m 2 c 4 ) + 1 


1 — E jmoc 2 


x 


(-4+m) ' 


(28) 


(29) 


(30) 


(31) 


From (27) and (29) we can now calculate the energy eigenvalues of si and p\_ 
states. If we assume Rq to be small ( mocRo/h 1), we may solve (27) and (29) 
approximately by expanding in terms of mocRo/h. A short calculation (which is left 
as an exercise for the reader) leads to the following Table 9.1, where n = 1, 2, 3, . . . 
labels the states. Similarly, one can find approximate solutions for (30) and (31) 

Table 9.1. Energy eigenvalues for s\/ 2 and p \/2 states in a small potential box 


E 

Vo(/c= -1) 

V 0 (k= +1) 

me 2 

hrnr 2 

— 3 m Q c 

moc Rq 

h nn o 

— m 0 c 

moc Ro 

0 

H nn 2 

— -m 0 c 

moc Rq 

h mv 2 

— + m 0 c z 

moc Ro 

-moc 2 

h m r 2 

— 4- moc 

moc Ro 

htt-ir 2 

— + imoc 

moc Ro 


Exercise 9.5. 


n — 1,2,3,... ( m 0 cR 0 /h ) < 1 

for the opposite limiting case of a very large potential box (m^cRo/h 1), and 
this is shown in Table 9.2. One sees that the p\_ states are energetically higher than 
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Table 9.2. Energy eigenvalues for j 1/2 and p i/2 states in a large potential box 


Vo(k = -1) 


moc 


—m 0 c 


2 


(w - 1/2) V /i 2 
2mRo 

moc 2 ( 1 + 


n 2 n 2 h 2 \ 
2m o c2/? o / 


2m 0 c 2 ( 1 + 


n 2 n 2 h 2 
4m o c2R o 


n = 1 , 2, 3, . . . ( mocRo/h ) » 1 


Vo(/c = +1) 


n 2/ ir 2 h 2 

2moRl 


moc 2 


2moc 2 


(n + l/2)W \ 

2m 2 c 2 /? 2 / 

, (n + l/2)W^ 

i, + 4m 0 V* 2 ; 


the s states, which can be understood intuitively because of the orbital angular 
momentum l = 1 for the p states. But even for the 5 states (with / = 0), for a 
given R 0 a minimal potential depth Vo is required in order to get at least one bound 
state, in contrast to the one-dimensional problem, where at least one bound state 
always exists. This is due to the fact that for the s state of a Dirac particle in a 
three-dimensional potential well there is an angular momentum barrier due to the 
spin. Indeed, this can be easily seen by decoupling (14), differentiating again with 
constant Vo and reinserting: 

[^ + ro )2 -m„V]-^^±12}j = o , 

/" + { t< £ + ‘'o) 2 - mlc 4 ] - }/ = 0 . (32) 

On the one hand, for s states ( k = —1) the angular momentum barrier is zero 
for the large components. On the other, the equation for / contains an angular 
momentum term, which increases the energy in the three-dimensional case even 
for s states. 

In Fig. 9.11 the eigenvalues 7 [found numerically from (27)] for the Is state in 
potential wells with different values of R 0 have been plotted. One sees that for 
( mocRo/h ) <C 1 as well as for (niocRo/K) 3> 1 the energy eigenvalue £(V 0 ) grows 
almost linerarly with Vo. As in the one-dimensional case, we can determine the 
scattering phase shifts of the continuum. For the s waves this can be done with 
little effort, whereas for the waves with higher angular momentum the matching 
condition at r = Rq cannot be evaluated easily. Let us therefore look at the scat- 
tering phase shifts of the s waves. First we have to match solutions of the interior 
region, 


u\(r) — a\ sin&jr , 



E + Vb — moc 2 / sin&jr 
E + Vq -I- moc 2 \ kir 


cos&jr 


( 33 ) 


at r = Rq to the solution (( hck 0 ) 2 = E 2 — m^c 4 ) of the outside region: 


From J. Rafelski, L. Fulcher, A. Klein: Phys. Rep. 38, 227 (1978). 
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Fig. 9.11. Is eigenvalues as 
a function of the potential 
strength for a spherical po- 
tential well. One example of 
the reults for the 2s level are 
also shown 



w°(r) — b\ sin k$r — cos k 0 r , 


« 2 °(r) = 


Introducing 


I E — mpc 2 
E + nipc 2 
sin k 0 r 


bx 


k 0 r 


— cos k 0 r ) — b 2 ( C0S k° r _|_ sin k 0 r 
k 0 r 


(E + Vo-mpc 2 ) (. E+mpc 2 ) 

7 _ V l E + Vo + m 0 c 2 ) (E - m 0 c 2 ) ’ 


( 34 ) 


( 35 ) 


and making use of the fact that the wave function is continuous at r = Rp, we 
conclude with 


A = k\Rp , A 0 = k 0 Rp 


that 


b\ = a\\ -7— V— ^ cos A + 7 cos A, cos A 
cos A 0 


+ sin A\ — - — - + sin A sin A 


b 2 = a\ 


-7- 


4) 0 

sin A . 


+ sin A\ 


Ax 
sin A 
A 


sin A + 7 sin A cos A 


sin A cos A 


( 36 ) 
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Figure 9.12, 

Phase shift sin 2 (for 5 

waves) of a spherical poten- 
tial box as a function of en- 
ergy 


From the asymptotic behaviour of the outside solution one can derive a phase shift 
M|°(r) ^ > Aa\ sin (k 0 r + 6) , 


u 2 ,( r ) ~z~7ZT >f ^\l cos(fc 0 + 6) 


■ (37) 
Inserting (36) into (34) and comparing this with (37), one derives the equations 

(38) 


cos 6 = 


bx 


sin<S = 


b 2 


Aa\ ’ " ~ Aa\ 

and after some algebraic rearrangements the result 
( 1 7 

6 = - A 0 + arccot I — — + 7 cot A t 

\ ^0 


—k 0 Ro + arccot 
1 


x 


„ _ „ he E + mne 2 

7 cot(kiRo ) + — i / — « 

«o V E — m 0 c z 


1 


E + moc 2 E + V 0 4- m 0 c 2 


(39) 


As in the one-dimensional case the term k 0 Ro is the same for all states and therefore 
this term is usually absorbed into the definition of the phase shift: 

= 6 + koRo 


( ,, „ . he E + mne 2 

= arccot 7 cot(fcj/?o) + —\ , 

y Ro V E - m 0 c 2 

1 1 
E + moc 2 E + Vo + moc 2 

In Fig. 9.12 the phase shift of the s states for a potential of depth V 0 = 3 m 0 c 2 and 
radius Rq = 10A e has been depicted. Once again the zeros of and sin 3 6' 

correspond to resonances; also resonances in the range moc 2 — Vq < E < — m 0 c 2 
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of “dived” s states in a supercritical potential appear. A comparison with Fig. 9. 1 1 
shows that the positions of the resonances are just where one would expect them 
to be by extrapolating the curve E(V o) of the binding energies. 


EXERCISE 


9.6 Solution of the Radial Equations 

for a Dirac Particle in a Coulomb Potential 


Problem. Solve the coupled radial equations for a Dirac particle in a Coulomb 
potential and determine the energy eigenvalues for the bound states. 


Solution. The Coulomb interaction energy of a point nucleus and a particle of 
charge — e is V = —Ze 2 /r, so that the radial equations (cf. Example 9.3) for a 
Dirac particle read 


dG 
dr 
d F_ 
dr 


-G + 


E + moc 2 Za 
he r 


-F - 


rtiQC 


he 


Za 

H 

r 


F(r) 

J 

G(r) , 


( 1 ) 


where a = e 2 /hc « 1/137 is the fine-structure constant. First we shall examine 
the solution of the equations (1) for small r, i.e. near the origin (r ~ 0). In this 
case the terms with E ± »iqc 2 can be omitted and one gets 


dG 


Za r( x 

dr 

H — G — 

— Fix) = 

r 

r 

d F 

K „ 



H — F + 

—G(r) = 


dr 


( 2 ) 


Using a power series expansion as an ansatz for the solution of (2), the first term 
of this series dominates in the region near the origin. This motivates the ansatz 
G — ar y and F = br y . Using this it follows that 

ajr y ~ l + Kar 7 " 1 - Zabr y ~ 1 = 0 , 

bjr y ~' + Kbr y ~ x + Zaar y ~ x = 0 (3) 


or 


a( 7 + k) — bZa = 0 , aZa + b( 7 — k) = 0 . (4) 

The determinant of the coefficients must vanish, yielding 
1 2 = K 2 -(Za) 2 , 

7 = iv ^ 2 - (Za ) 2 = ±yj (j + \f - Z 2 a 2 . (5) 

Since the wave function has to be normalizable we must choose the positive sign 
for 7 . For the negative solution 7 = — I 7 I it follows that F 2 + G 2 ~ r~ 2 ^ near 
r = 0, which would yield a divergent integral for the norm if ( 7 ) < However, 


Exercise 9.5. 
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we should mention that for k 2 = 1 and ZaV 3/2 or Z > 118, regular solutions 
with negative 7 seem possible. Indeed , to have F 2 + G 2 ~ r ~ 2 \/\-iZa) 2 st j|j 
integrable for r -» 0, the inequality 2^/1 - (Z «) 2 < 1 should be fulfilled, which 
leads to ( Za ) 2 > In Exercise 9.12 we shall show that these “solutions” do, in 
fact, not exist. Here we only give a plausibility argument: these solutions must be 
discarded because of the following postulates: Not only the normalization integral 
/ 'ijF H D -ib d 3 r must exist, but also the expectation value of each partial operator 
within H d , especially the expectation value of the Coulomb energy: 

yV = J (F 2 + G 2 ) (-^ 7 -) dr . 

Note that the usual factor r 2 from the volume element, according to (25) of Example 
9.3, is already contained in ( F 2 + G 2 ). The integrand behaves like (r +2y )/r dr — 
r +2 7-1 dr in the limit r — ► 0, yielding a finite contribution only if 27 - 1 > - 1 , 
i.e. 7 > 0. This, on the other hand, means that only the positive root in (5) is 
physically meaningful. Furthermore one can conclude from relation (5) that for 
states with; + \ = 1 only solutions up to Z = a~ l ~ 137 can be constructed. 
For larger values of Z the root becomes imaginary and the wave functions are no 
longer normalizable. For (Za) 2 > « 2 , in general the real part of the wave function 
shows an oscillatory behaviour of the form Re(F, G) ~ cos(| 7 | lnr) for small r. 
We shall deal with this strange situation later when we treat electrons in the fields 
of extended nuclei and discuss the supercritical vacuum. In order to solve (1) we 
make the following substitutions: 


g = 2Xr with A = 


(wqC 4 — E 2 ) 

he 


2\V2 


( 6 ) 


With dg/dr = 2A and d/d r = 2 A d/dg and dividing by 27 it follows that 
dG(e) kG(q) 


d g 

d F(g) 
d g 


+ 


E + moc 2 Za 
2A he + V. 


F(Q) 


E — m^c 2 Za 
2 A he + — 


K , 


G(e) + -F(g) 

7 


( 7 ) 


Using this form of the equations one can get the behaviour of F(g) and G(g) for 
g -* 00 , since neglecting the terms proportional to 1 j g the differential equations 
(7) read 


dG(£>) E + moc 2 

d e 

d F(g) 


2hc\ 

E — m^c 2 


F(g) 

G(g) 


d^> 2hcX 

Combined with ( 6 ) it follows immediately that 
d 2 G(f?) 


( E 2 — m q "* 4 


dg 2 


mzcA 1 
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One gets two possible solutions with G(g) ~ e 1 ^/ 2 , but only the exponentially 
decreasing one can be used since only this one is normalizable. A similar result 
holds for F(q). This motivates the ansatz 

G(g)= (m 0 c 2 + E) l/2 e~ 6/2 ((f>i(g) + 4> 2 (q)) , 

F(g) = (m 0 c 2 -E) 1/2 e~ e/2 (<pi (g) - (hie)) , 

which we insert into (7) to give: 


( 8 ) 


(m 0 c 2 +E) e 


] / 2 r-e/2 


1 ,, , a ^ j_ d< ^ 2 

•2 W,+W+ aF + 17 


— K 


(m 0 c 2 + £ ) 1 /2 e e/2 ((j} \ + <p 2 ) 


+ 


E + niQC 2 Za 
2hcX g 


(moc 2 - £)' /2 e e/2 {<p\ - </> 2 ) 


(m 0 c 2 — E) 1/2 e _<?/2 

£ — moc 2 


1,. , ^ ,^1 d ^ 2 

_ 2 W ’ - fc)+ dF“ dF 


+ 


Za 


(m 0 c 2 + £) 1/2 Q~ e ^ 2 {4>\ + <h) 


2hc\ Q 

+ - ( m 0 c 2 - £) 1/2 q- q ! 2 {4>\ - <h) ■ 
g 

Dividing by e~ e ^ 2 and furthermore the first equation by {m^c 2 
second one by (moc 2 — £) 1//2 yields the result 

1 , . I / \ i d 0i - d</>2 

_ 2 W,+W+ dF dF 


m 

£)'/ 2 and the 


= (<pi + (f ) 2 ) + 

e 


E + m 0 c 2 Za] (m 0 c 2 -£) 1/2 

+ — I —m - 4>i) 


- jW. - « + d(? 


£ — moc 2 


2hcX 
dcf>i dh 


e J {mac 2 + £) 


1 / 2 ' 


+ 


d£> 

Za' 


2hcX 

On the other hand we have 
, 1/2 


. 1/2 


e J (moc 2 - £) 7 ^ 


( 10 ) 


(moc 2 — £) moc 2 — E 
(moc 2 + £) 


1/2 


Tic A 


and 


(moc 2 + £) ' 72 moc 2 + E 

(m 0 c 2 - E) 


1/2 


he x 


(ID 
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and therefore 
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1 ,± , - X , d 0i d0 2 

7 (01 + <P2) 4- — 1- — — 

2 Ag dp 


— (01 + 02) + 

Q 


E + moc 2 Za 
2hc\ Q 


(moc 2 - E ) 

HcX 


1/2 


-(01 - 02 ) 


1 ,, j \ I d 01 d0 2 

2 ( * - <h) + jj- - 

E — moc 2 Za 
2hcX q 


moc 2 + E 


hcX 


(01 + 02) H (01 - 02) 


Adding both equations of (12) yields 
,d0i 


01 + 2' 


d Q 


2k , , Za (woe 2 - £) 

= 0 2 + 01 H r— r (0i — 02) 

g g HcX 


Za (m 0 c 2 + E) 


(01 + 02 ) 


Q hcX 
whereas by subtracting them we get 

t d02 
dp 
2k 


- 02 + 2- 


, , Za ( itiqc 2 - E ) 

01 — 02 H r— r (01 — 02) 

£> 0 ficA 

Za (moc 2 +E) 

H i - ; (0i + 0 2 ) • 


P ftcA 
Summarizing all this yields 


dp 

d02 

dp 


k Zamoc 2 

g hcXp 


02 


k Zamoc 2 \ ZaE 
~Q + -fcX^) (t) ' + teX-p (t>2 


( 12 ) 


(13) 


(14) 


(15) 


In order to find the solutions for <+ and 02 we make the ansatz of a power 
series expansion. Separating out a factor p 7 , which describe the behaviour of the 
solution for g — > 0, we write 


01 , 02 = £ 7 X> m £" 

m=0 m=0 

Inserting this into (15) yields 
]P(m + 7 )a m p m+7_1 


(16) 




- K + 


?m+7 _ Za£ 
he A 

Zamoc 2 






E^ m+7_1 


(17) 
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^/3 m (m + 7)e m+7 1 

Zarrioc 2 


-ft + 


Y a m e‘ 


m+ 7—I 


+ 


ZaE 


hex J ' he X 

Comparing the coefficients we conclude that 




m+7 — 1 


ZaE „ A. , Zam 0 c 2 \ a 
Qmipi H - 7) — ®m — 1 x (ft "I" . J f3 m , 


he X 

„ , . , Zamoc 2 t 

f3 m (m + 7) — ( H - — - — ) + 


he X 
Za£ 


he\ J m ' hcX ^ m 

From the second equation of (18) it follows that 
Pm ~ ft + ZanriQC 2 jhcX ft — Zam^c 2 /hcX 


a m m+j — ZaE/heX 


n-m 


with 


ZaE 


n = 


~1 


hcX 

For m = 0 one gets 

Po ft — Zamoc 2 /he X ft — (n f + ~f)m§c 2 jE 

ao n r n f 

Inserting the result (19) into the first equation of (18) yields 

ft — Zamoc 2 /hcX\ 


or 


a n 


Oin 


ZaE ( Zamoc 2 

m +7 + r- + ( ft + 


he X 


he X 


n r — m 


)\ 


ZaE\ . 2 Z 2 a 2 m2c 4 

m + 7 + - J (n - m) + tc 5 — 


he X J v ' ' ” h 2 c 2 X 2 

We calculate both brackets on the lhs of (23): 


(18) 


(19) 


( 20 ) 


( 21 ) 


= OL m - 1 (22) 


= a m -i(n l -m) . (23) 


ZaE\ (ZaE \ 2 2 (ZaE\ 2 

m+7+ feA )( teX ■ 7_m ) = “ 2m7 - m " 7 + (far) ' (24) 


with 7 2 = k 2 — (Za) 2 and it follows that 


Oin 


2 (ZaE\ 
- m(2 / y + m) + (Za) 2 + ( J 

= a m -i(n'-m) , 


Zamoc 2 \ 2 
hcX J 


which can be further summarized as 

_ in' -m) _ (-1 ) m (n' - 1 )...(n' -m) 

am m( 2 ^ + m) am ~ X m\( 2 j + 1) . . . (27 + m) a ° 
(1 — n0(2 — n') . . . (m — n 1 ) 
m\( 2 ^ + 1) . . . (27 + m) a ° 


(25) 


Exercise 9.6. 


( 26 ) 
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According to (19) (3 m is found to be 


_ (k — Zampc 2 /hc\) (-\) m ( n ' - 1 ).,.(n / -m) 

n' — m m\(2y + 1 ) . . . (27 + m) a ° 


A) ■ 


Using (21) this yields 

m n'(n' - \)...(n' -m + !) 
m !(27 -f 1 ) . . . (27 + m) 

This power series turns out to be the confluent hypergeometric function 

, , a a(a + l)x 2 

F(a,c\x) = 1 + -x + . 

c c(c + 1) 2! 

We thus find that 

<t>\ = a^Ffi -n',2y+ l;p) , 

<t >2 = ftatP F(—n' , 27 + 1; q) 
k — Zamoc 2 /hc\ 


chqQ 1 F (—n' , 27 + 1; q) . 


( 27 ) 


(28) 


(29) 


(30) 


In order that the wave functions remain normalizable we must require that the 
series for pi and p 2 terminate; thus the hypergeometric functions have to be simple 
polynomials. This can only be achieved if n' is a non-negative integer, i.e. n' = 
0 , 1 , 2 ,... . 

We define a principal quantum number 

n - n' + |k| = n' +j + \ , n = l,2,3,... . (31) 

With this we can calculate the energy eigenvalue from (20) of this example and 
obtain 


ZaE 


(mfic 4 


E 2 ) 


1/2 


n + 7 


+ 7 


and consequently 


(32) 


(Za) 2 + (n -j - \ +7)" 


E 2 = m%c 4 (n - j 


i+7 y 


(33) 


E — +moc 2 
(-) 


1 + 


(Za) 2 




n — 1 , 2, 3 . . . , 

k — ±(J + j)= ±1, ±2, ±3, . . . 


-I-1/2 



(34) 


The negative sign in (34) must be excluded because, for positively charged nuclei 
(Za > 0), negative energies E do not fulfill the original equation (32), since its 
rhs is positive. We therefore write the negative sign in (34) in parenthesis, and we 
thus obtain the Sommerfeld fine-structure formula for the energy eigenvalues of 
electrons in atoms with a Coulomb potential and point nuclei. 
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Finally we want to quote the complete expression for the radial wave functions. 
Here the wave functions are normalized according to the prescription f — 

1, which explicitly implies for f(r) and g(r) that 


rOO 

/ (f 2 + g 2 )r 2 dr = 1 . (35) 

Jo 

This leads to the final expressions for the normalized radial wave functions : 8 

5(r) l _ ±(2A) 3/2 

f(r) J " H2 7 + 1) 


X 


X 


(moc 2 ± E ) r(2j + n f + 1) 


\ 4moc 2 
(2Ar) 7_1 


(n' + j)m 0 c 2 ( C n' 4- 7 )m 0 c 2 \ 

E ( E T ! 

e -,{( (,.- + ^ _ K ) F( _ n ,, 27 + 1;2Ar) 


7 nT(l -n , ,27 + l;2Ar) 



(36) 
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EXERCISE 


9.7 Discuss the Sommerfeld Fine-Structure Formula 

and the Classification of the Electron Levels in the Dirac Theory 


Solution. In the preceding problem, for the electron eigenvalues in a Coulomb 
potential we derived 


- 1/2 


E = +moc 2 

n' -n - j 
n = 1, 2 , 3, . . . 

• 1 3 5 

J — 2 ’ 2 ’ 2 ' ' ' 


1 + 


(Za) 2 


\ = n 


n ~ J 

'« I 


+ 


\/ 0 + D 


(Za) 2 


( 1 ) 


The energy eigenvalues thus only depend on the principal quantum number n, on 
|/s| and on Z. For a vanishing potential (Z = 0) the energy eigenvalue is -fmoc 2 . 
The bound electron states thus adjoin the continuum of positive energy beginning 
at -Fmoc 2 . This is plausible because, due to a “switching on of the potential”, that 
is, due to a continuous increase of the coupling strength Za from Za — 0, electron 
states from the positive energy continuum can be “pulled” into the energy gap 
between m$c 2 and — raoc 2 , thus becoming bound states. The limit of ionization of 


The integral of normalization can be derived by a lengthy but clearly stated calculation. See 
W. Greiner: Quantum Mechanics - An Introduction , 3rd ed. (Springer, Berlin, Heidelberg 
1994) Chap. 7 (Exercise 7.1). 
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an electronic atom is obviously moc 2 , and the ionization energy of an electron in 
the state rij therefore reads 


£ioniz = moc <1 


moc 2 (Za) 2 


1 + 


(2a) 2 


-i -1/2 


[n - |ft| + yf* 


J3L + <^(3_ _ 

\2n 2 2 n 3 \Jk| 4 nj) 


(Za) 2 ] z J 

2 

4 n 


( 2 ) 


which is equal to the negative binding energy, i.e. £i,i nd = -E lomz . We draw 
attention to the existence of bound states even for negative nuclear charge Za < 0 
with energies corresponding to the negative of the solutions for Za > 0. This 
seemingly paradoxical result will be explained later in connection with charge 
conjugation in Chap. 12. 

For states with j — 1/2, energy values can be calculated up to Za < 1, i.e. up 
to Z ~ 137; for j = 1/2, n — \ we have 


m$c 


1 + 


(Za) 2 
1 - Z 2 a 2 


- 1/2 


= moc 2 y J 1 — Z 2 a 2 


Za = 1 yields E — 0 or £bind = E — m$c 2 = —moc 2 . With increasing Z the 
absolute value of the binding energy also increases, as it should do. The “slope” 
dE/dZ approaches infinity for this \s \/2 state (n — 1, l — 0, j = 1/2) at Za = 1, 
while for Za > j + 1/2 the energy becomes imaginary. Thus it seems that there 
exist no bound ns j/ 2 or np\/ 2 states for point nuclei with charge greater than 
Z = 1/a = 137. This strange result can be understood in connection with the 
supercritical phenomena as a collapse of the vacuum ; a new, fundamental process. 

To calculate actual numbers, we replace moc 2 by the electron’s rest mass 
moc 2 — 0.5110041 MeV. For n' = 0 the confluent series terminates only for 
k < 0. For k > 0 they diverge even for n' = 0. It thus follows that 


f 0, 1 , 2, . . . for k < 0 
1 1,2,3,... for k > 0 


For Za <C 1 the energy formula can be expanded: 


E — moc 2 

me 2 


-(Za) 2 


1 (Za) 2 ( 1 

In 2 2n 3 \j + I 


3_ 

An 


( 3 ) 


The degeneracy of levels with equal \k\ but different /, already stated previously, 
remains unaffected. The first term in (3) represents the Bohr formula for the energy 
levels of the atom calculated according to the Schrodinger equation. Accordingly, 
relativistic corrections for the energy levels in a Coulomb field are of the order 
(Za) 2 . These corrections are only significant for small principal quantum numbers 
and in heavy nuclei. Furthermore we note that the relativistic wave funtions f(r) 
and g(r) show for |k| — 1 a weak (but quadratically integrable) divergence at the 
origin r ~ 0, in contrast to the nonrelativistic case. The states are classified in 
complete accordance to the levels of the hydrogen atom (Z — 1), as summarized 
in detail in the following Table 9.3. Table 9.4 shows the binding energy for the 
\s \/2 electron as a function of Z. The situation is illustrated in Fig. 9.13. 
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Table 9.3. The classification of bound states of the electron according to the Dirac equation Exercise 9. 7. 

for Z = 1 (hyrogen atom) 


Notation 

n 

l 

j 

f 

n 

K 

£bind / cV 

1*1/2 

1 

0 

1/2 

0 

-1 

-13.606 

2*1/2 

2 

0 

1/2 

1 

-1 

-3.402 

2 P\/2 

2 

1 

1/2 

1 

1 

-3.402 

lP3/2 

2 

1 

3/2 

0 

-2 

-3.401 

35-1/2 

3 

0 

1/2 

2 

-1 

-1.512 

3p\/i 

3 

1 

1/2 

2 

1 

-1.512 

3/>3/2 

3 

1 

3/2 

1 

-2 

-1.512 

3^3/2 

3 

2 

3/2 

1 

2 

— .512 

3d$/2 

3 

2 

5/2 

0 

-3 

— 1.512 

4*1/2 

4 

0 

1/2 

3 

-1 

— 0.850 

4pi/2 

4 

1 

1/2 

3 

1 

-0.850 

4/23/2 

4 

1 

3/2 

2 

-2 

-0.850 

4J3/2 

4 

2 

3/2 

2 

2 

-0.850 

4d 5 /2 

4 

2 

5/2 

1 

-3 

-0.850 

4/5/2 

4 

3 

5/2 

1 

3 

-0.850 

4/7/2 

4 

3 

7/2 

0 

-4 

-0.850 


Table 9.4. Table of binding energies for ls \/2 electrons as a function of Z according to the 
Sommerfeld fine-structure formula 


Z 

^bind/^y 

z 

^bind/^y 

10 

-1362 

80 

-96117 

20 

-5 472 

90 

-125 657 

30 

-12396 

100 

-161615 

40 

-22 254 

110 

-206256 

50 

-35 229 

120 

-264246 

60 

-51585 

130 

-349 368 

70 

-71699 

137 

-499 288 



Fig. 9.13. The solutions of the 
Dirac equation for an elec- 
tron in a Coulomb central 
potential: ... for point nu- 
clei (for 5 1/2 and p \/2 exist- 
ing only up to Z = 137), - 
for extended nuclei (see Ex- 
ample 9.9). The supercritical 
case Z > Z C rit ~ 172 will be 
discussed later in detail 
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9.8 Solution of the Dirac Equation for a Coulomb and a Scalar Potential 

We shall solve the Dirac equation for a mixed potential consisting of a scalar 
potential and Coulomb potential. Both cases differ in the manner of coupling into 
the Dirac equation. In the case of the Coulomb potential minimal coupling is used as 
usual, whereas the scalar potential is added to the mass term of the Dirac equation. 
Therefore it can be interpreted as an effective, position-dependent mass. In the 
same way that the Coulomb potential is derived from the exchange of massless 
photons between the nucleus and the leptons orbiting around it, the scalar potential 
of the form V 2 = —a'jr is created by the exchange of massless scalar mesons. 9 The 
cr-meson frequently quoted in the literature has a very high mass and therefore the 
corresponding potential has a very short range. Our investigations concerning the 
potential V 2 and its influence on the energy eigenvalues can therefore be regarded 
as a model study. The scalar potential can be interpreted as a Newtonian potential 
of the form V 2 — Vn = GM 0 m 0 /r = —hca'/r. 

For a mixed scalar and electrostatic potential, the Dirac equation thus reads 

[ca-p + P(moc 2 + V 2 ) ~(E -V ,)] = 0 . (1) 

With the assumption of spherical symmetric potentials, we can again derive the 
coupled radial differential equations in the usual way: 

dG Av 1 r -■) -i 

— = — G + — [E + m 0 c 2 + V 2 - V, F(r) , 

dr r he 1 J 

T- = + ~ F - T- i E - m ^ 2 Vl ] G ^ • (2) 

dr r he 1 J 

Defining 


Q OL 

V\ = — he — and V 2 — —he — 
r r 


(3) 


with a and a f being the electrostatic and the scalar coupling constants, respectively, 
we get 

E + moe 2 1 


dG At 
— — — G + 
dr r 

dF 

1F = 7 f ~ 


he 

E - moe 2 1 


-(a ~ a 1 ) 
r 


he 


+ “(Of +^') 

r 


F(r) , 

G(r) . 


(4) 


In the subsequent calculations we follow a very similar procedure to that used for 
finding the solution of the Coulomb potential, so that it is not necessary to repeat 
every single step in detail (see Exercise 9.6 and 9.7). We first consider the region 
r ~ 0, where the constant terms proportional to mass and energy can be neglected: 

9 For a more detailed discussion, see W. Greiner, J. Reinhardt: Quantum Electrodynamics , 
2nd ed. (Springer, Berlin, Heidelberg, 1994). 
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d G k (a -a 1 ) 

— = G + F(r) , 

dr r r 


d F k 

-r" = ~ F 
dr r 

The ansatz 
G - ar 7 


{a + a') 


G(r ) 


F = br 7 


yields 

ajr 7 ~ ] + Acar 7-1 — (a — ai)br 7 ~ ] = 0 
b^r 7 ~ l — nbr 7 ~ l + (a + a')ar 7 ~ 1 = 0 


Example 9.8. 


(5) 

( 6 ) 

(7) 


a( 7 + k) — b(a — a 1 ) — 0 , 

a(a + a') + b( 7 — k) = 0 . (8) 

These are two linear homogeneous equations for a and b, and the coefficient 
determinant yields 

2 2 2 , r2 

j = k — a + a 


or 


7 = 


± \/« 2 — a 2 + a' 2 


(9) 


To allow for the normalization of the wave functions, we again choose the positive 
sign for 7 and introduce the substitution 


g = 2A r with A = 


(m 0 c 2 c 4 - E 2 ) l/2 
he 


The differential equations thus have the form 


d G 
d 0 
d F 
d^ 


K „ 

—G + 
Q 


E + moc 2 (cr — a ') 


hc2\ Q 

moc 2 (a + a') 


hcl\ 


+ 


G + -F 

e 


which with the choice of 

G = (m 0 c 2 + £)' /2 e~ e/2 (4>i + 4> 2 ) , 

F = (moc 2 - E) 1/2 e-e'Hfa - fa) , 

leads to: 

- + $ 2 ) + ^ + ^ = --(h + fa) 

2 dp dp g 


+ 


E + moc 2 (a — a') 
2hc\ o 


moc 


hcX 


-{fa — fa) 


( 10 ) 


(ID 


( 12 ) 
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2 d g d q 

E - m 0 c 2 (a 4- a') 


mr\C 2 + E , , K 

7 — r (0 1 + (f) 2) H (01 — 02 ) 

hC A Q 


2 hc\ Q 

Adding and subtracting the equations yields 

2d0i 2 k (a-a')moc 2 - E 

~ 0 i + — = <h + <P 1 + r — ; — (0 1 - 02) 

dp £> p he A 


(a + a') moc 2 4- £ 
p /icA 


(</*i + 02) 


and 


2d02 2 k (a - a') m 0 c 2 - E ^ 

-02 + — = 01 - 02 + ^ (01 - 02 ) 

d Q Q Q rlC X 

(a + a') m 0 c 2 + E 

H r— : — (0i + 02) , 

£ hCA 

respectively. We now collect terms and obtain: 
d0i ; 

d e 


aE a'moc 2 \ 

1 - — — — 1 01 


k amoc 2 a'E \ 
g ^ hcXp ^ hcXp ) 2 


hcXp HcXp 
d02 _ f k amoc 2 a'E \ ( aE a'moc 2 \ 

d g \ g hcXp hcXp ) 1 \/k:Ap + hcXp ) 2 

For 0 i and 02 we make the power series ansatz: 


( 13 ) 


0i = amQm ’ 02 = Q 1 XI f3mgm 

m — 0 m — 0 

and insert them into the differential equations, giving 
+ -f)a m g m+1 ~ l 




aE m+7-1 a'moc 2 

hcX 


k + 


hcX 

amoc 2 ^ 


E 


a m Q 


m+ 7 — 1 


/icA /icA y 

'E.Pmim +'y)g m+ ' r ~ l 

amoc 2 a'E 




-K + 


+ 


he A /icA 

aE a'moc 2 


^ ' &mQ 
m+7-1 


m+7— 1 




hcX hcX 
A comparison of coefficients yields 

, . ( aE a'm 0 c 2 \ 

“” (m+7) = Q - 1 -(teA + -&rJ 


O'* 


ft + 


amoc 2 ^ a'2? 


( 14 ) 


( 15 ) 


Pn. 


„ , , , amoc 2 t 

0m (^ + 7) — — ft H r — : h t — r ) a m + 

he x he x 


he x ’ he x 1 m 

aE a'moe 2 


hcX ^ hcX 


(3n 


( 16 ) 
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and from the second equation we get 

P m —n + am^c 2 jhcX + a'E jhcX 
a m m + 7 — aE jhcX — a'moc 2 /hcX 
k — amoc 2 /hcX — a'E/hcX 
n' — m 


with 


(17) 


n = 


aE a'moc 2 


+ 


~7 


he x he x 

As in the case of a pure Coulomb potential, this is the equation for determining 
the energy eigenvalues. Only for n' = 0, 1,2, . . . do the resulting confluent hyper- 
geometric functions for the wave functions degenerate to polynomials, and we get 
standing waves. 

Defining 

n — n 1 -\rj-\r\ — 1,2,3,... , 

we get 

aE + a'moc 2 


1 


(wqC 4 — E 2 ) 


which leads to 


1/2 


n-j - ^+7 


(18) 


a 2 + (n — j - \ + 7) 2 E 2 + 2aa'moc 2 E 
= m%c A (n-j - \ + 7) 2 - a ,2 mlc A 


or 


£ 2 + 


laa'rriQC 1 


a 2 + (n -j -\+l Y 


2 4 


+ rriQC 


a 


/2 


- (n ~j - 2 + 7 )' 


a a + (n — j 


1+7)' 

Finally we obtain the energy eigenvalue 


-0 


(19) 


E = moc z 


-aa 


a 


il 


a 2 + (n-j -\ + 7 ) 


1/2 


aa 


a* 


+ ( n 


+ 7/ 


(n -j - I + 7 y 


a 2 + ( n - j - | + 7 Y 
Let us now consider several special cases: 
( 1 ) 


( 20 ) 


a = 0 


7 


yj k 2 + ot 1 


/ 2 


£ = ±ra 0 c * / l - 


or 


/2 


(» - J -5 + 7 )' 


Example 9.8. 
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Obviously there exist two branches of solutions in the bound region: the solutions 
for positive and negative energies exhibit identical behaviour, which reflects the 
fact that the scalar interaction does not distinguish between positive and negative 
charges. The order of the levels is striking: 1 s x/2 , 2 p i/2 , 2s X / 2 and 2 p 1/2 , 3 s\/ 2 , 
and so on. For a' — > oo, E approaches the value E = 0 (see Fig. 9.9 in Exercise 
9.2). 

States with negative energies correspond to antiparticles. The particle and an- 
tiparticle states approach each other with increasing coupling constant, without 
touching. Critical behaviour, as in the case of the Coulomb potential, does not oc- 
cur; the Dirac vacuum remains stable in the case of scalar potentials (see Exercise 
9.2). 

( 2 ) 


a = 0 , 

E = m 0 c 2 


7 = y/ k 2 — a 2 


1 + 


or 


n V2 


(« -j 


1+7)' J 


We have here to choose the positive square root, because in the case of very weak 
fields the electron states dive from the upper continuum into the region of bound 
states. Furthermore the negative square root yields a contradiction to (18). The 
obtained result is, of course, identical with Sommerfeld’s fine-structure formula 
(see Exercise 9.6 and 9.7). 

(3) a = a'. 

This yields 7 = |/c| and n - j - j + j = n. 


E — moc 2 


—a 


a 2 -I- n 2 


± 


or 


(a 2 + n 2 ) 2 


a 2 — n 2 
a 2 + n 2 


me 


moc 


± 


-a" 
a 2 4- n 2 

—a 2 n 2 

a 2 + n 2 a 2 + n 2 


I a 4 - a 4 + n - 
( a 2 + n 2 ) 2 


The negative sign would seem to yield a solution E 2 = -moc 2 , and the large and 
the small component in the potential term of the Hamiltonian decouple. However, 
this is an invalid solution, because E = ~m 0 c 2 contradicts (18). For the positive 
sign, it follows that 



2a 2 ) 

a 2 + n 2 J 


For a — > 00, E approaches the value —moc 2 asymptotically, but the state never 
dives into the negative energy continuum. For n — 1 and a — 1 the energy becomes 
zero. 
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EXAMPLE 


9.9 Stationary Continuum States of a Dirac Particle in a Coulomb Field 

Again we start with the coupled radial differential equations for F and G: 

F , 


^ = -^G + 

dr r 


E + moc 2 Za 

ij + — 

he r 


d F_ 
dr 


K 


-F + 


E — moc 2 Za 

+ — 

he r 


( 1 ) 


and perform the substitution x — 2i pr, where 


[• E 2 - (me 2 ) 2 ] 

he 


2 \ 21 l / 2 


— i\ 


( 2 ) 


With dx/dr — 2i p followed by the division by 2i p, one gets 

F , 


d G k _ 

— — — G + 

dx x 


d F k 

— — + — F 
dx x 


E + moc 2 Za 

|_ 2i phe x 

E — moc 2 Za 

[ 2i phe x 


(3) 


For F and G we make the following ansatz: 
(a) for positive energies E > m§c 2 \ 

G = \JE + m o c 2 (0i + <j> 2 ) , 

F = i \JE - m o c 2 (0i - 0 2 ) , 

and (b) for negative energies E < - moc 2 : 

G - y/-E - m 0 c 2 (4>i + 4> 2 ) , 

F = —i\/—E + m o c 2 (0 1 - 0 2 ) • 


(4) 


(5) 


Inserting (4) into (3) and dividing by \/E + moc 2 and i \JE — moc 2 , respectively, 
yields 


d0i d <f> 2 

dx dx 


d0i 

dx 


— (0i + <p 2 ) + 

x 

d<fe 

dx 


E 4- moc 2 Za 
[ 2iphc x 


iy/E 


moc z 


\ y/E + 


ni()C l 


(4> 1 - <fo) 


K 


-(4>i - <t> 2 ) 


’ - moc 2 Za 
2\phc x 


\/E + m 0 c 2 

Ji\/£ 


moc z 


(01 + 02 ) 


( 6 ) 


Anal ogously, ins erting equation (5) into (3) and dividing by \J -E - m 0 c 2 and 
-i yJ-E 4- moc 2 , respectively, yields 
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d(j> i d02 

dx dx 


— + (j ) 2 ) + 

X 


E + moc 2 Za 


[ lip He 


+ 


(~i )y/-E + mpc 2 
\/-E - moc 2 


(01 — 02) 


d0i d02 
dx dx 


= “(01 “ 02) - 
X 


E — moc 2 Za 
x 


lip he 


\yf—E 


moc 2 


V-E + 


(01 + 02 ) 


moc 2 


(7) 


Let us now consider the factors on the rhs of (6): 
\E + moc 2 Za 


lip he x 


i(E — moc 2 ) i(hc) 2 p 2 Za i E iZam^c 2 

hep lip 2 (hc) 2 x hep xhep 


E — moc 2 Za 

lip he x 


(E+moc 2 ) — ( hc) 2 p 2 Za E Zam^c 2 


i hep li 2 p 2 (hc) 2 x ihep i hep 

Adding the equations in (6) and division by 2 yields 

d0i k , 1 i ZaE , iZamoc 2 , 

— - = 02 + -01 + — 01 H z 02 ■ 

dx x 1 hepx xhep 


• ( 8 ) 


(9a) 


Similarly, subtracting the equations (6) from each other and dividing by 2 we get 

lartiQC 2 iZaE 

-r 9 1 - T 92 , (9b) 

jl z, acpx acpx 

or together: 


d</>2 K 1 

— — 0j - -02 

dx x 1 


dh 

dx 


1 i ZaE\ 




k ( iZamoc 2 
x 


xhep 


d02 

dx 


K 

X 


iZamoc 2 


01 + 


hepx 

The factors on the rhs of (7) yield 


1 i ZaE\ 

1 hepx ) 2 


E + moc 2 Za] E 2 - m 2 c* _ j Za(-i)hcp 

— - -|- | ■— - — -j- j— — 


[ lip he 


-E — moc 2 


( E + moc 2 ) x 


( 10 ) 


E - rriQC 2 Za] \[ El ~ m o cA _ 1 
2i phe x J i (-£ + m 0 c 2 ) 


Zahcp 
2 ^ x(— i) (E — moc 2 ) 


( 11 ) 


Addition of the equations in (7) followed by division by 2 leads to 
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#1 _ « , , 1 , 

~T~ ~ — — <P 2 + X01 


dx 


+ 


1 

+ 2 


2 

i Zahcp 


+ 


Zahcp 


( E + moc 2 ) x x( — i) ( E — moc 2 ) 

—Za(—i)hcp Zahcp 


4> i 


+ 


— (£ + moc 2 ) x x(-i) (£ — moc 2 ) 


02 


— 0 2 + X01 


1 

+ 2 


1 

+ 2 


K 


Zahcp ( E — moc 2 ) + Zahcp ( E + moc 2 ) 


(-i)x (£ 2 


m?c 4 


o l 


) 


4>i 


-Zahcp ( E — moc 2 ) + Zahcp (£ + moc 2 ) 
— ix [E 1 — rnfic 4 ) 


02 


1 


\ZaE 


— 02 + X01 4 Z 01 + 

x 2 x hep 


iZaniQC 7 

xhep 


02 


and analogously, the difference of Eqs. (7) yields 

d<p2 k 1 iZamoc 2 iZa£ 
d7 = ”x 01 " 2 02 " —hcpT^ ~ ~h 


( 12 ) 


(13) 


Comparison of (12) and (13) with the system (10) shows that both systems of 
differential equations are identical, and that for positive and negative energies we 
have to solve the same system of differential equations. 

Now we consider the complex conjugate of (10) (jc is purely imaginary): 

i ZaE\ ^ ( k \Zamoc 2> 


d 0[ 

dx 

d02 

dx 


1 


iZamoc 2 \ . (\ 


0i + 


hepx J ' ri ' \2 
Equations (14) are identical to (10) if we set 

01 = 02 or 02 = 0* • 


xhep 
i ZaE\ 
hepx J 


02 

02 


(14) 


(15) 


This is the necessary condition for G and F to be real- valued functions and thus 
to describe standing waves at infinity. We now eliminate 02 from the system of 
(10) to give: 

-l 


. k \Z arrive 2 

02 = + z 

x xhep 

hepx 


d0i 

dx 


def)] ( 1 i ZaE 

—Khcp + iZamoc 2 dx \2 hepx 


1 i ZaE\ ( ( k \ZamQC 2 

+ T 101 — + ° 

x hep 

01 


2 ' hepx ) \ x 

hepx 


-l 


-nhjcp 4- iZamoC 2 


(16) 


hepx 


d0i 


iZaraoc 2 — nhep dx 
1 ftcpx 


01 - T7 


\ZolE 


2 iZamoc 2 — nhep iZaraoc 2 — Kftc/? 


-01 


Example 9.9. 
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and calculate its derivative 


#2 

dx 


hep 


dfa 


iZamoc 2 — nhep dx 
1 hep 


+ 


hepx 


d 2 fa 


iZamoc 2 — nhep dx 2 


1 


fa-^1 


hepx 


dfa 


2 iZartiQC 2 — nhep 
i ZaE d(f>\ 

iZamoc 2 — nhep dx 

( n hep + iZamoc 2 
hepx 

( hepx ( dcj>\ 


2 iZ attire 2 — nhep dx 
[see (13)] 

i ZaE\ 


~ 1 i + hepx ) 


\ iZamoc 2 — nhep \ dx 


1 \ _ i ZaE 

2 r J iZamoc 2 — nhep 


fa 


• (17) 


This yields 

hepx 


iZamoc 2 


+ 7 


d 2 (/>i hep — i ZaE d<j>\ 


nhep dx 2 iZamoc 2 — nhep dx 
i ZaE dcj>\ 1 hep 


iZamoc 2 — nhep dx 2 iZamoc 2 
i ZaE 


i K i 

— — <p\ + <p\ 

nhep X 


iZamoc 2 
H r fa 


-<t> l 


hepx ' ri 2 ( iZamoc 2 — nhep ) 

iZa£ (ZaE) 2 


-<f> l + 


2 (iZamoc 2 — nhep ) hepx (iZamoc 2 — nhep ) 
1 hepx 


fa 


4 iZamoc 2 — nhep 
To summarize: 


fa =0 


d 2 4>\ 1 d(pi 1 n (iZamoc 2 - nhep ) 

dx 2 x dx 2x^ 1 + xhcpx ^ 

iZamoc 2 (iZamoc 2 — nhep) i ZaE 

+ ^ ~ fa ~ fa 


(hepx) 2 
i ZaE (ZaE) 2 

2hcpx 1 (hepx) 2 1 


2 hepx 


1 


fa = 0 


(18) 


(19) 


fir 


</>i ?fa 


2 ^ x 2 


d 2 0i + 1 d^i 1 ^ l ZamoC 2 n 

dx 2 x dx 2x^ xhcpx 

1 (iZamoc 2 ) 2 iZamoc 2 nhcp 

4 1 (hepx) 2 (hepx) 2 

iZaE ± , (ZaE) 2 ± _ n 
~ + ~ 0 ’ 


<£i 


and finally 

d 2 <?S>i 1 dfa 

dx 2 x dx 


r^i 


2x 


* - 4 * + 


iZa£ 

hepx 


fa =0 


( 20 ) 


( 21 ) 
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Rewriting (21) leads to 


d 2 </>, 


7 1 d<t> i 

dx 2 x dx 


1 


4 V 2 


1 i ZaE\ 1 


hep J 


7 


I i 


fi=0 


( 22 ) 


where 7 2 = k 2 — ( Za ) 2 . It is sufficient to know </i(x); then f 2 (x) results from (15). 
To solve the differential equation (22) we substitute 


W=X 1 / 2 0, , 

and a second-order differential equation for 0] results: 
d 

dx 


(23) 


x - U2 dW _ 1 - 3 / 2 w 

1 

+ “ 

1/2 dW _ 1 3/2 ' 

dx 2 

X 

dx 2 J 


— + I — + 
4 V 2 


1 i ZaE\ 1 


hep J 


7 


1 7 


x~ 1/2 W = 0 


Therefore, 


- 3 /»«: +x . 1/1 ^ + 3 _ 5/2h ,_i 
2 dx dx 2 4 2 dx 

+ x -3/2dW__ 1 -5/2^ 
dx 2 


1 


4 + \2 + 


1 i ZaE\ 1 


hep ) 


- + 




x - i/2 W =0 


so that the final result reads 
d 2 W 


dx 2 


1 

4 + 


1 + {ZaE \ I + 7 2 ~ 1/4 


2 hep J x 


W(x) = 0 


(24) 


The regular solution (at x = 0) of the second-order differential equation, in which 
there appears now no first order derivative, is the Whittaker function 10 


VK_ ( i >+ i/2), 7 (x) =x 7+1/2 e x/1 F ( 7 + 1 +ry,27+ l;x) 


with 

7 


ZaE 

hep 


(25) 


(26) 


Thus, [and with (23) and (2)], <f>\ becomes 

4>\ = N(y + iy)e lT, (2pr) 1 e~ lpr F('y+ 1 +iy,27+ l;2ipr) 
= N(y + xy) {Ipf 1 f(r) , 


(27) 


where we have introduced the phase p, which has to be adjusted to make <^ 2 = 4>* 
valid. Now we rewrite the first equation of the system (10): With x = 2i pr and 
d/d r = 2i p d/dx then 

10 See, e.g. M. Abramowitz, I. A. Stegun: Handbook of Mathematical Functions (Dover, New 
York 1965). 


Example 9.9. 



244 


9. Dirac Particles in External Fields: Examples and Problems 


Example 9.9. 


hcpr 


d0i / i ZaE\ , ( k iZamoC 2 

ir = r + ^7j* + (-7 + 

Inserting 4>\ of (27) into (28) yields 

,#(>0 


<t>2 


N^ + ^e^iipy- 


d r 


+ 


i ZaE \ 

ip + N ^y + iy) e>v (. 2 P)' 1( P( r ) 

r + XZ °hcpr ) ^ (7 ~~ 1 y)e~^(2/?) 7 </>*(r) 


Solving the equation with respect to e ,TI produces 


2i?7 _ 7 ±jy 


e ' = 


7 - 1 y 

7 + i y 


k iZamnc 2 
-+ 

r ncpr 


J_d0 
</>* dx 

(i/? + iZ a£ / hcpr) <f> 


7 — iy (— «/r + iZ amoc 2 / hcpr) 4>* 


i + jy 
i — iy 


—rhcp 


1 d</> 


—nhcp + iZamoC 2 <^>* dr 


+ 


ihcp 2 r + iZaE <j> 


Also 


—nhcp + iZamoc 2 <^>* 
ikcp 2 r + iZa£ hcp(ipr + i ZaE /hep) 


-nhep + \ZamoC 2 —hcp(n — iZamoc 2 /hep) 
_ — ip(r + ZaE)/hcp 2 
k — iZartioC 2 /hcp 
_ —ipr(\ + ZaE /hcp 2 r) 


k — iZamoc 2 /hcp 


holds. Thus it follows that 
e — 2ir, = 7 + i7 r 


Id 4> . A y \ 4> 

7 — iy k — \ym^c 1 lE [<j>* dr ^ \ pr ) <$>* 


and in accordance with (27) we have 


(28) 


(29) 


(30) 


(31) 


(32) 


<K r ) = r 7 e ipr F (7 + 1 + iy ,27 + l;2ipr) . (33) 

Hence we may apply the following relations for the confluent hypergeometric 
function 

d a 

—F{a,c,x) = —F(a + l,c + \\x) 
dx c 

= a C F (a, c + l;x) + F(a,c\x) , (34) 

e~ x/2 F( 7 + 1 + iy ,27 + l;x) = t x/2 F{ 7 — iy , 27 + 1 ; -x) , 
xF{a + l,c + l;x) = c[F(a, + l,c;x) - F(z,c,x )] 


(35) 

(36) 
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and obtain 

(j>* = r 7 e lpr F( 7 + 1 — iy , 27 + 1; — 2i pr) 

— r ~i Q- l P r p(^ + iy , 27 + 1 ; 2i pr) 

and 

^ = r 1 &~' pr ^-F{ 7 + 1 + iy,27 + l;2ipr) 
dr dr 

+ F(7 + 1 + ry,27 + 1; 2i pr)e~ ipr (7 r 7-1 - i/?r 7 ) 
7 + 1 + iy - 27 - 1 , 


= r 7 e~ ,pr 


2ipF(-y + 1 + iy, 2y + 2; 2ipr) 


27+I 

+ 2ipF(y + 1 + iy , 27 + 1; 2ipr) 

J 

+ F(7 + 1 + iy, 2y + l;2i pr)e~ ipr (7 r 7_1 - ipr 7 ) 

7 + iy 


27+1 


r 7 2ip e~ ,pr F( 7 + 1 + iy , 2y + 2; 2ipr) 


+ F(7 + 1 + iy , 27 + 1 ; 2ipr) (i/rr 7 e >pr + 7 r 7 1 e lpr ) 
e ‘'' r (27 + 1) 


_LiJZ r 7-i p- i r r r 
27+1 


x [F(7 + 1 + iy, 27 + 1; 2i pr) - Fij + iy , 27 + 1; 2i pr)\ 
+ F( 7 + 1 + iy , 27 + 1; 2i pr) [i pr 1 e~' pr + 7 r 7_1 e _,pr ] 
= iyr 7-1 e~ ,pr F( 7 + 1 + iy, 27 + 1 ; 2i/?r) 

- (-7 + iy)r 7-1 e -lpr F(7 + iy,27+ 1 ; 2i/?r) 

+ i pr 1 c~ ipr F( 7 + 1 + iy , 27 + 1; 2ipr) 


= e 


ipr r 7 1 [(iy + ipr)F(y + 1 + iy , 2y + 1 ; 2i pr) 


- (-7 + iy)F(l + iy , 27 + 1; 2i pr)] 

Now we state that 

r [d^/dr - ip(l + y /pr )<?!>] 


7-1 y 


4>* 


= 1 


or 


# - A ’A 

r- \pr<p — 1 y<p 

dr 


= <£*(7 - 17) 


(37) 


(38) 


(39) 


(40) 


which can indeed easily be proved: 

(iy + i/>r)F(7 + 1 + iy , 2y + 1; 2ipr) - (-7 + iy)F(7 + iy , 27 + 1; 2ipr) 

- (i pr + iy)F(7 + 1 + iy, 27 + 1; 2i pr) 

= (7 - iy)^(7 + i7) 2 7 + l;2ipr) , qed. (41) 


From the statement (39), the phase (32) follows as 
k — iymoc 2 /E 


e 2ir > = 


7 + iy 


(42) 
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For the radial functions of equations (4) and (5) we may write 

G = C 1 N(2/?r) 7 {(7 + iy)e _lpr+1,7 F( 7 + 1 + iy, 27 + 1; lipr) + c.c.} , 

F — 1 C 2 N (2pry {(■y + \y)t~' pr+ ' T, F{'y + 1 + iy ,27 + \\2ipr) — c.c.} , (43) 


taking (27) and (15) into account, (by c.c. we mean here the complex conjugate) 
where 


C, - 


\JE + m 0 c 2 
. \J ~E - m 0 e 2 


for E > moc 2 
for E < —moc 2 


(44) 


C 2 = 




moc *■ 


-V-E + 


moc 1 


for E > moc 2 
for E < —moc 2 


(45) 


Let us now determine the normalization factor N. We normalize the continuum 
wave functions with respect to the energy axis, i.e. to delta functions of energy: 


J = 8(E' - E) . 


(46) 


Furthermore, we assume that for r — > 00 
G = AC\ cos (pr + 8) , 

F = -AC 2 sin (pr + 8) (47) 


holds. We will discuss the proof of (47) later on. According to (4) and (15), for </>, 
this means that 


4>\ = \ A ex P i' ( P r + *5)] 

Now we show that for 
1 


A = 


V^p 


(48) 


(49) 


the normalization condition (46) is fulfilled. Indeed, for E > m 0 c 2 it follows that 
[ dr l \/E + m 0 c 2 -2— [e i(pr+5) + e -^ r+fi) l 

Jo l 2 V*P 


x \JE' + moc 2 


1 


-y/E^ 


moC‘ 


2JW 

1 r 


Q i(p'r+S') _|_ e -i (p'r+S 1 ) 


X \JE’ — rtlQC 2 


2 y/Wp 

1 


(pr+6) _ -i 


i(pr+<5)j 


2y/7 vp 7 


e i (p'r+6') _ e ~i (p'r+6 f ) 


} 
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| POO f J 2 

[giCP+p'^+tf+tf' _|_ e i(p-//)r+<5-<5' e i(p'-/?)r+<5'-<5 e -i(p+//)r-£-$' 


— \JE — moc 2 yjE / — moc 2 - 


47T (pp f ) 1 / 2 

x lyfr+p'Jr+fl+a' _ e i (p-p')r+6-6' _ (p'-p)r-6+6' _j_ Q -i(p +p' )r- 6- 6' 

= ^ [\/£ + m 0 c 2 \/ E' + mo c 2 ^ ^) )1/2 [«5(p - p') + 8(p - p')\ 

[8{p-p') + 8{p-p')} 

-6{p -p') 


+ \JE - m 0 c 2 \/E' — moc 2 
(£ + moc 2 ) 


1 1 


-8{p-p') + 


2 (pp 1 ) 1 / 2 

(E - moc 2 ) 


£ E8(E-E') _E6(E-E') 

~ p diP P) p dp/dE p E/p 


1 

52, 

II 

(50) 

where we have set 


p = \Je 2 — niQC 2 . 

(51) 

The derivation of relation (50) is only valid 

in a strict sense if the phase 8 does 

not depend on r. Moreover, the following relations for the <5 function were used: 

1 r°° ■ 

— / e lkx dk — 8(x) , 

2 7T J_ 00 

(52) 

6(-x) = 8(x) , 

(53) 

f(x)6(x - a) = f(a)8(x - a) , 

(54) 

*>«] = ^ 

, (55) 


8(x) = 0 for x >0. The result (50) is also valid for E < -moc 2 . In order to 
describe the asymptotics of the functions (43), we apply the asymptotic behaviour 
of the confluent hypergeometric functions for r — > oo: 


F(a,c-,r)^^f\r“- c e r . 

r(a) 

Thereby from (43) we obtain for r — ► oo that 
G = QN(2pryr(2'Y + 1) 

F = iC 2 N(2p r yr(2-f + 1) 


(56) 


('y + iv)e i/jr+l,? 

w + v/ e ( 2i pr yy-i + c.c. 

A 7 + 1 + iy) 

"(7 + i y)e'P r+iT > 


A 7 + 1 + iy) 


(2i pr)' y 7 — c.c. 


(57) 
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Now we perform some transformations: 
i~ 7 = e z =4> z = —7 ln(i) . 

For z = x + iy = r e' e , lnz = In r + i 0 =» In i = and it follows that 

i -7 = e -« 7/2 (5g) 

Analogously one gets 

(2i pr) iy = e z =>■ z = iy ln(2i pr) — iy ( In i + ln(2 pr)) 

= iyif + i}’ ln(2 pr) , 


i.e. 


(2 iprf = e _7r - v/2 e' vlna/,r) 


( 59 ) 


Furthermore, with .T(z + 1 ) = Z-T(z) one calculates 
7 + iy _ 1 _ 1 

F( 7 + 1 + iy) _ r(7 + iy) _ |r(y + iy)|e* 

e -iarg T(7+iy) 

|A 7 + iy)| 


( 60 ) 


With that the radial functions from ( 57 ) have the asymptotic form ( 47 ). In doing 
so, one gets 


1 _ 2 N e~*y/ 2 r ( 27 + 1 ) 

y/np |r(7 + iy )| 


( 61 ) 


The normalization constant N is fixed by this equation. For a phase 6 we obtain 

ivy 

8 =y ln(2pr) - arg T(y + iy) - -y- + V ■ ( 62 ) 

This is the Coulomb phase, which is already known to us from the nonrelativistic 
problem. Thus, the final results for the wave function are 

C\( 2 pr ) 7 e ,ry / 2 |i~'(7 + iy)| 

K 2(’Kp) 1 / 2 r(2'y + 1 ) 

x {e~ lpr+tT, (-y + iy)F(y + 1 + iy,2y + l;2ipr) + c.c.} , 

F iC 2 (2prye«y/ 2 \r(y + iy)\ 

K 2(itp) x / 2 r(2'y + 1) 

x {e~ tpr+,v (y + iy)F(y + 1 + iy,2y + l;2i pr) - c.c.} . ( 63 ) 

We see that the calculation of the continuum wave function is a difficult task. Be- 
sides the partial waves we have discussed here, one can also construct travelling 
waves. 11 Let us merely remark that for the calculation of all scattering processes 
with heavier nuclei, and also for the calculation of quantum electrodynamic pro- 
cesses, the here-determined continuum waves represent the necessary and appro- 
priate technicals tools. For the study of “new” quantum electrodynamic processes 
in the strong fields of heavy ion collisions, the continuum waves have even to be 
calculated for extended nuclei. 


11 For a detailed discussion of this calculation, see M.E. Rose: Relativistic Electron Theory 
(Wiley, New York, London) 
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EXAMPLE 


9.10 Muonic Atoms 


A muon is a particle which has the same properties as the electron in almost every 
attribute, except that it is about 207 times heavier than the electron, i.e. 


= 207 m e 

(m e c 2 = 0.51 1004 MeV, m^c 2 = 105.655 MeV). 

Since it also has spin \ and negative charge (the antiparticle of the is the 
/t + , which has positive charge), it also obeys the Dirac equation. Some effects, 
which play only a minor role for the electron in usual atoms, become important for 
the fi~ when it “circles” around the nucleus forming a muonic atom. These effects 
are linked to the large mass of the muon, which implies that the Bohr radius of the 
jjT is smaller than that of the electron by a factor of 


m e _ 1 

207 


( 1 ) 


Thereby all the effects which are connected with the extension of the nucleus 
(modified Coulomb potential at smaller distances, quadrupole interaction, etc.) are 
more important in the case of the muon than for the electron. Some of these effects 
we will briefly explain in the following. Before that, though, a brief abstract of the 
formation and the most important physical processes involved in the creation of 
muonic atoms seems appropriate. 

Production of Muonic Atoms. Muons are created by performing inelastic scat- 
tering experiments of high-energy protons with protons (hydrogen). There, pions 
are produced according to the reactions: 


p + p 


{ 


P + p + 7T + 7T + . .. 
p + n + 7T + + . . . 


( 2 ) 


The pions are separated from the beam with magnetic fields. They decay into 
muons within about 10 -8 s due to the weak interaction: 


7 r -> /x + . (3) 

The n~ produced in this way are now decelerated; then, the slow [i~~ are bombarded 
onto ordinary electronic atoms, e.g. Pb. Through the interaction of the n~ with the 
electrons, some electrons will be knocked out of the atom, the will be further 
decelerated and finally captured by the nucleus. The “capture orbit” of a muon is 
one of the outer orbits with principal quantum number n w 14. Such a muonic 
atom (with one muon and still many electrons) is then in a highly excited state. 
In the first step, the de-excitation takes place via transitions within the outer shells 
in which the muon transfers energy to further electrons, which are consequently 
emitted (Auger emission). The transitions between inner muonic atom shells result 
in 7 emission, which can be observed and measured (see Fig. 9.14). 

We now discuss a few remarkable differences between muonic atoms and elec- 
tronic atoms. 
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Fig, 9.14. Illustration of the 
inner a orbits around an 
atomic nucleus. The influ- 
ence of the electrons, which 
are positioned far out, is 
schematically indicated by a 
cloud 



(a) Recoil Effects, Since, in comparison to the muon, the nucleus does not have 
infinitely high mass, both particles move around a common centre of mass, which 
is not in the centre of the nucleus. For the Schrodinger equation the centre-of-mass 
motion can, as in classical mechanics, easily be separated. Then the relative motion 
is simply described by a one-particle equation with the reduced mass p 

AV = ; n • ( 4 ) 

nucleus 

Here m ^ is the mass of the muon. In the case of the Dirac equation, this separation 
does not work, because: 

(1) A satisfying two-body Dirac equation 12 does not exist. 

(2) The centre-of-mass system can no longer be defined geometrically (as in clas- 
sical mechanics), but only dynamically; namely as a Lorentz system, in which the 
sum of the momenta of all the involved particles vanishes, i.e. Y^=\ Pu = 0. In 
an approximation one can manage the problem by assigning the above-mentioned 
reduced mass p n to the muon in the Dirac equation. The binding energies of the 
muons are considerably influenced by the thus (approximately) considered recoil 
effect. In view of the experimental precision of AE jE < 10 _4 -10 -5 , there appears 
no difficulty in verifying these effects. 

(b) Retardation Effects. If one considers the interaction between the nucleus and 
the muon in the form of a static Coulomb potential in the Dirac equation, one 
thereby performs an external field approximation. The interaction within this ap- 
proximation takes place without time delay; the effects of retardation which are due 
to the recoil motion are neglected. If the velocity of the muon in the inner shells 
of heavy nuclei becomes comparable to the velocity of light c, the retardational 
effects should become large and then also the previously mentioned “reduced mass 
approximation” turns out to be insufficient. In this case it is necessary to consider 

12 A relativistic many -body mechanics has been proposed by F. Rohrlich [see Annals of 
Physics 117, 292 (1979)]. It remains, though, to be seen if this theory 7 can be quantized 
in a satisfactory way. We refer also to the Bethe-Salpeter equation, which is discussed in 
W. Greiner, J. Reinhardt: Quantum Electrodynamics , 2nd ed. (Springer, Berlin, Heidelberg 
1994). 
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a two-body equation derived from the quantized theory (field theory), the so-called 
Bethe-Salpeter equation. Since these effects are just near to the edge of the maxi- 
mal precision of present experiments, we will not discuss them in this context any 
further. 

(c) Screening by Electrons. Though the Bohr radius of the many electrons in the 
atom is much larger than the one of the muon, the tails of the electrons’ wave 
functions reach into the inner region (see Fig. 9.15). 



Fig. 9.15. Qualitative picture of the charge distribution of a nucleus g p (r) with half-value 
radius c and surface thickness t. The charge distributions of the muon q m and the electrons 
Q e are multiplied by r 2 . It can be recognized that the muon stays partly in the interior of the 
nucleus and the electrons stay partly within the muonic orbit. Note the logarithmic r scale! 


Thereby the electrons cause a screening potential for the muons which is not 
negligible. For symmetric charge distributions g e (r) it can easily be calculated that 

^screening ( 7" ) = 4ne 2 j 1 J Q e {r')r a dr' + Qe(r')r ' dr' | . (5) 

Here 

N 

&( r ) = ( 6 ) 

1=1 

is the charge density of the N electrons still bound within the atom, ^ being their 
wave function. For more precise calculations one even has to determine the wave 
function of the muon and those of the electrons self-consistently. As a typical case 
for the screening of the muon by the electrons, we consider the 5 g - 4/ transition 
in muonic lead, whose transition energy amounts to AE ~ 400 keV. Due to the 
electron screening, the single muon levels will the less bound by an energy of about 
17keV. Calculating the transition energy, the effect of screening then reduces to a 
few eV, because both levels are shifted in the same direction (to weaker binding). 

(d) Vacuum Polarization, Self- energy and Anomalous Magnetic Moment. In Chap. 12 
we will learn about field-theoretical effects, 13 which cause small, but measurable, 
deviations from the Coulomb interaction of the electrons or muons with the nu- 
cleus. One has to consider the effect of vacuum polarization, the self-energy and the 
anomalous magnetic moment. The vacuum polarization describes the creation of 
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13 In particular this is discussed (with more precise calculations) in W. Greiner, J. Reinhardt: 
Quantum Electrodynamics , 2nd ed. (Springer, Berlin, Heidelberg 1994). 


252 


9. Dirac Particles in External Fields: Examples and Problems 



Fig. 9.16. Illustration of the 
vacuum polarization as a 
“dipole cloud” in the electric 
field of the nucleus 


virtual electron-positron pairs in strong electromagnetic fields. The virtual electron- 
positron pairs form a cloud of dipoles, leading to a modification of the Coulomb 
potential (see Fig. 9.16). 

The properties of the vacuum are similar to a dielectric medium. The dominant 
part of the vacuum polarization potential is the so-called Uehling potential given 
by 

4 a \ r°° 

^vp(f) = — £ J [Zi(k — r '\ ) — Z](r + r , )J dr , (7) 

where the structure function is 

z, w = /^ xp {-| r e}(i + ^)«lzi^l de , m 

with X e = 386. 1592 fm being the Compton wavelength of the electron. For r > X e , 
i.e. in the region where mainly atomic electrons are present, the vacuum polarization 
potential falls of exponentially. Therefore very heavy particles, like the muons (and 
pions), whose probability is large in the region Nucleus < r < X e , are particularly 
suitable for examining the vacuum polarization potential. Nowadays the precision 
of experiments with muonic atoms is so accurate that processes of higher order 
than those in (7) and (8) also have to be considered. The self-energy describes the 
interaction of one particle with itself by emission and reabsorbtion of a photon (see 
Fig. 9.17). There is a similar cause for the so-called anomalous magnetic moment, 
whose contribution to the energy is illustrated by the graph in Fig. 9.17. Here we 
only note that the self-energy and the anomalous magnetic moment are of the order 
1/m. Therefore they are more important in the case of electrons than for muons. 
In view of the present experimental accuracy the latter only play a minor role. 


Fig. 9.17. (a) Self-energy dia- 
gram, (b) anomalous mag- 
netic moment 



(a) 



(e) Nuclear Deformation. Until now we have assumed that nuclei have a spherical 
symmetric charge distribution, which may be well described by means of a two- 
parameter charge distribution with half-density radius c and surface thickness t as 
parameters. The most common one is the so-called Fermi distribution, which is 
given by 


6 = 


Qo 

1 + e< r-c >/ f 


( 9 ) 


The half -density radius c and the surface thickness t are free parameters, which 
are determined from fits to experimental cross-sections of fast electrons. Another 
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method for their determination is the following: When measuring the transition Example 9.10. 

energies of various 7 transitions in a muonic atom, accurate values for c and t 

can be determined by means of so-called c — t diagrams. In other words: The 

transition energies of the muonic atom depend on the parameters c and t of the 

nuclear charge distribution (9). However, many nuclei show deviations from the 

spherical shape. For this reason one has to expand the nuclear charge distribution 

into multipole moments: 

Q(r) = Qo (r) + ^ Qim(r)Yi m (6, (p) . (10) 

lm 

In most cases nuclei are axially symmetric, but deformed in a cigar-like manner, 
i.e. they exhibit a prolate deformation. Then (10) reduces to 

g(r) = Qo(r) + Q 2 (r)Y 20 + . . • , (11) 

where Q 2 — £> 20 - The second term describes the quadrupole deformation. Accord- 
ingly, the quadrupole moment of the nucleus is defined as 


Qo 


-s 


p(r)r 2 T 2 o dV = 2\ 



/ 47T 

"5 


g 2 (r)r 4 dr 


( 12 ) 


Not only the absolute values, but also signs of these quadrupole moments 
can be determined from the pronounced hyperfine structure of the muonic atoms. 
Hyperfine structure means observable level shifts, additional to the l • s splitting 
(fine structure). 

In order to reproduce transition energies, e.g., in a muonic uranium atom one 
introduces even four-parametric charge distributions, like 


////// 


g(r,c,t,f3, 7 ) = go 


1 + exp 


-c(i + /?r 20 ) 

. 3 Hl+faYm) 


-1 


(13) 

-5 


In comparison to (9) the half-value radius and the surface thickness are now 
treated as angular-dependent quantities, implying that the density of the “atmo- 
sphere” at the tip of the “nuclear cigar” is different from that on its short axis. 

A typical spectrum of a heavy muonic atom is sketched for uranium in Fig. 9.18. _ 10 
Furthermore it illustrates the difference between the solutions for point-like and 
extended nuclei, and the different effects discussed above are indicated as far as 
their order of magnitude allow. 


“ 3 ^ 3/2 


2 Pm 

2 P\ri 


1 * 1/2 


(a) 


- 3<Z 3 /2 


2 pvi 

2 P\a 


(b) 


(f) Nuclear Polarization. Until now the nucleus has been assumed to be static. 
This implies that the nucleus will not be influenced by the presence of the muon. 
However, the nucleus behaves like a dielectric medium and thus it is polarizable. 
It can be excited by the muon and its charge distribution can be slightly changed. 
We write the total Hamiltonian 

H = H mc \ em (v ) 4* H meson (v ) 4- Ant (^4 v ) , (14) 

where Ant = ~Ze 2 /\r — r ' | describes the electromagnetic interaction between 
nucleus and meson. One performs a multipole expansion 


Fig. 9.18a, b. Binding ener- 
gies of muons in a uranium 
atom, (a) The more realis- 
tic spectrum with extended 
nucleus. The hyperfine split- 
ting due to the quadrupole 
moments of the uranium nu- 
cleus is indicated for the \s 
level, (b) The same spectrum 
for a point-like U nucleus. 
The 1^1/2 state lies at —27.35 
MeV, i.e. outside the diagram 
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2 1 


l,m 


4-Tre 
21 + 1 


^y /m (W ; ;a4eso„) 


(15) 


and treats H ml in higher-order perturbation theory. 14 

Let us note a typical value for the contribution of nuclear polarization effects. 
In muonic Pb the transition from 2p ] / 2 to 1 s\f 2 has a transition energy of AE ~ 
lOMeV, whereas the contribution due to nuclear polarization AE np is of the order 
of lOkeV. 


(g) Isomeric Shift. The ground and excited states of a nucleus do not need to have 
equal charge radii. If a nucleus has been excited, then usually the average quadratic 
nuclear radius changes by Ar 2 . Such an isomeric shift gives rise to a level shift in 
muonic atoms and can be deduced (measured) from studying the level structure. 
For the first excited nuclear state one finds typical changes of the radius of the 
order of 

—r-jX ~ 10 -4 to 2 X 10 -3 . 

v) 

Such an excited state may be formed during the cascade of a muon within the 
atom. The mesonic transition energy is not always emitted by radiation of a real 
photon, but can also be absorbed directly in the nucleus. Accordingly, the nucleus 
will be excited into higher states. During the subsequent transitions the meson feels 
a different charge radius. 15 

(h) Polarization of Muons and Pions. In our previous considerations the muon 
and pion have been considered as structureless particles, disregarding their internal 
degrees of freedom. However, one knows that the pion has a charge radius of 
fpion = 0.8 fm. Moreover, considering that a pion consists of two quarks, it becomes 
obvious that also the pion can be polarized in the presence of the strong field of 
a nucleus. However, the corresponding energy shifts turn out to be just too small 
for an experimental verification at the present time. Concerning the muon, which 
belongs to the lepton family, an “extension” (a substructure) is still unknown, and 
up to now leptons (electrons, muons, tauons) also seem to be point-like particles. 


exercise 

9.11 Dirac Equation for the Interaction Between a Nuclear and an 

External Field Taking Account of the Anomalous Magnetic Moment 

The Dirac equation for a nucleon interacting with an external electromagnetic 
field contains an additional term describing the interaction between the anomalous 
magnetic moment of the nucleon with the electromagnetic field: 

14 We refer to the literature. See, e.g., J.M. Eisenberg, W. Greiner: Nuclear Theory, Vol.II: 
Excitation Mechanisms of the Nucleus, 3rd ed. (North-Holland, Amsterdam 1988). 

15 See: J.M. Eisenberg, W. Greiner: Nuclear Theory, Vol. I: Nuclear Models, 3rd ed. (North- 
Holland, Amsterdam 1987) p. 73. 
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^4 + K j -^-a tlv F t11 ' - Mjc 2 ^ ip(x) = 0 , 

where j = p and e p = \e\ for the proton and j = n and e n = 0 for the neutron, 
respectively. 


Problem, (a) Show that the additional term does not violate the relativistic invari- 
ance of the Dirac equation. 

(b) Show that this term satisfies the hermiticity condition 


eh A 


70 ^ 70 = % 


t 


eh A 


AM;c 


Remembering that when turning from the matrices a, (3 to the gamma matrices 
in the Dirac equation, a term 70 has been factorized out; see (3.8). 

(c) Show that the choice K p — 1.79 and K n = —1.91 corresponds to the 
experimentally observed magnetic moment. 

Solution, (a) Relativistic invariance of the term a liy F^ u as a contraction of two 
Lorentz tensors (covariants) is obviously manifest. 

(b) Since a^F^" represents a product of an operator a pu with a real tensor 
= ( F n»y [see (U5)] t i. e . 






with the field strength tensor F ^ v , it is sufficient to show that 


t ^ - 


'YoVjjLv'yo 


' fiy 


holds. For this purpose we use the relation 7 ^ = 70 and 7 * = - 7 , and in shorthand 
notation we have 7 ^ = loi^io- We explicitly derive 


lo&lvlo = 7o 


^ (inlv ~ Ivin) | 7o 

= ~^7o {llll ~ lilt) 7o 

= -^7o {lalvlllnlo ~ lolnlolvlo) 7o 

= ~ 2 (.Ivin ~ 'Ivflv) = & , 

where 7 q = B has been used. 

(c) First we rewrite the interaction term: 

= \lnlvF^ - ^IvlnF^ 


= ^InluF^ ~ -jl^F' 1 '* 1 = Hnl^F^ , 
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where we have renamed the dummy indices, and the relation = -F u ^ has 
been used. By considering that 

F'=F 0 ' and B = (F 2 \F 3 \F 12 ) 

and 7*7/ = iXy, with 



a f 0 

0 &j 


as well as 7*70 = a*, the sum reads explicitly: 


= E^* 7 "* + E"*^’ - 7 o 7 .f“ - + -l» 1 oF°° . 

j <k j <k 


In view of the relation F 00 = 0 , 7,7* = -7*7, and = -F kj , the sum becomes 

= ^(E^ - 'E-tnoF 10 ) . 


We obtain 

7,7 2 F 12 = iZ^ , 7273 F 23 = iEiBi 
737 i ^ 3I = 1S2B2 = 7173F 13 , 


and thus 


^ ljlkF jk = iZ • S and 7y7o^° = -a • E , 
j<k j 

which yields 

53^ F/M/ = 2i{il7-B + 6.£7} . 

M," 


Instead of the original Dirac equation we obtain 


(Wy — ^4 - M jC 2 ) t/w = K i ^ • B - i« • w*) • (i) 

The Ihs exclusively describes the coupling with the electromagnetic field. We know 
that this part of the equation leads to a nonvanishing magnetic moment if e ; - ^ 0 
(i.e. only for proton), namely 


'‘' =MB = iV ““ , ‘" =0 ' 

In the nonrelativistic limit the rhs of ( 1 ) takes the form 



eh 

2 M~c 


\& • B - i-<r • E 

l c 



since 
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E = 


& 0 
0 a 


and a ~ -a 
c 


We can neglect the second term, because in the nonrelativistic case v/c =<^C 1 
it is small compared with the first term, which just corresponds to the interaction of 
the magnetic moment with the magnetic field. Thus, we obtain the total magnetic 
moment 


/^p — (l + Kp ) 


eh 

2M p c 


for the proton, i.e. = 2.79 /in, with K v = 1.79 and 
eh 

** D = Kn m ^ = ~ 191 ^ 

for the neutron. 


EXAMPLE 


9.12 The Impossibility of Additional Solutions 

for the Dirac-Coulomb Problem Beyond Z = 118 


Motivation: For an electron coupled to a point particle with charge Z we can 
rewrite the Dirac equation into the two coupled equations 

, kG f E + nioc 2 Za 

G -\ 7 1 


F' 


r 

kF 

r 


+ 


he 

- mpe 2 
he 


+ 


r 

Za 

r 


0 


0 


( 1 ) 


To study the asymptotic behaviour of the solutions for r — > 0 we have to take into 
account only the derivative terms and the terms containing a factor 1 /r, whereas 
we can neglect the terms proportional to m and E. The resulting equations can be 
solved with the ansatz F(r) = F^r 1 , G(r ) = G^r 1 , yielding 


(k + 7 )G 0 - ZaF 0 = 0 , 
ZaGo — (k — 7 )Fo = 0 . 

( 2 ) 

This leads to the condition 


Fo Za k + 7 

(3) 

Go k — 7 Za ’ 

which requires 


7 = ± \/ k 2 — (Za) 2 . 

(4) 


One of the two independent solutions is regular at the origin, while the other 
diverges like r - l 7 L If the charge Z is small, this divergence is so strong that 
the wave function cannot be normalized and thus is to be rejected as a physical 
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solution. However, for a sufficiently weak divergence the normalization integral 
/ d 3 r(F 2 + G 2 ) is finite. This happens in the case 7 > - 5 . which means that 

Za > \J k 2 — For $1 and pi states (k = ±1) the charge has to exceed Z > 

\J\ x 137 ~ 118. Under this condition it might appear that both types of solution 
of the Dirac equation are admissible since they both are normalizable. This would 
have drastic consequences because the rejection of the irregular solution is essential 
for obtaining discrete energy eigenvalues. If both solutions were acceptable then 
there would exist a continuum of solutions for the following reason. For any given 
energy one can find a solution which is well behaved for r — > 00 . For r — > 0, on the 
other hand, this solution contains admixtures behaving like r + l 7 l and r~^ as well. 
For | 7 | < \ this cannot be accepted since the solution is no longer normalizable. 
Only for certain values of the energy does the component r - ^ vanish. These are 
the energy eigenvalues. For |-y| > on the other hand, any linear combination of 
r ±l 7 l leads to a solution normalizable at r — > 0. For any given solution one only 
has to make sure that the wave function is normalizable also at r — > oo, which can 
be done for every energy. 

Are these solutions physically relevant? A static point charge represents a very 
singular object which we actually do not find in nature, since any particle of 
finite mass cannot be localized at a single point. Furthermore, only nuclei (in fact 

superheavy nuclei ) 16 have charges of the order Za and these are clearly 

extended objects. Even if such a singular point source of such a high charge does 
not exist, it is still interesting to study the problem of an extended source in the 
limit that its extension gets smaller and smaller, either by shrinking the nuclear 
radius artificially or by studying a hypothetical elementary particle of that charge 
in the limit that its mass goes to infinity and the particle is localized more and 
more. The following exercise gives some insight into the problem. 


Exercise: Study k = — 1 states of Dirac electrons in the potential of a hollow 
sphere of charge Za > and radius R and show that in the limit R — > 0 only 

the solution behaving like r + V ,- ( z “> 2 is recovered . 17 


Solution: We proceed in the usually way. We solve the wave function for the 
inside and outside regions separately and then try to match the ratio F /G. In the 
outside region the differential equation contains the ordinary Coulomb potential. 
In the inside of a charged spherical shell the potential takes on the constant value 
Za/R, so that the differential equation becomes (k = -1): 



E +m 0 c 2 Za\ 

~ fi^ + -R- |F= ° 
E - m 0 c 2 Za\ 

~1^ + ~r )g 


( 5 ) 


16 See J.M. Eisenberg, W. Greiner: Nuclear Theory , Vol.III: Microscopic theory of the nu- 
cleus, 3rd ed. (North-Holland, Amsterdam 1990). 

17 This exercise has been worked out by W. Grabiak. 
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This is just the free Dirac equation for a particle with energy e — E + hcZa/R. 
The general solution is given by 

F — k cos (kr + 8) — ^ sin(A;r + 8) 

G = ( sin(kr + 8) ( 6 ) 


with lick = s/e 2 — (moc 2 ) 2 . Taking 8 = 0 this is the solution found in (33), 
Exercise 9.5. The validity of the generalization to 8 ^ 0 is easily checked by 
insertion into the differential equation. However, even without performing this 
calculation explicitly it is clear that the constant phase shift 8 can have no influence 
on the neutral cancellation of the sine or cosine terms. 

In contrast to the Coulomb problem with Z > 1 18 the constant potential admits 
only one normalizable solution, namely <5 — 0. Thus we get 


F 

G 


r=R , inside 


he 

moc 2 + e 


R 


— k cot (kR) 


(7) 


Now we are interested in the limit R — > 0. In this limit m and E can be neglected 
against Za/R, so that we can set e/hc — k = Za/R, and (7) becomes 


77 — ► if - + cot(Za) . ( 8 ) 

G Za 

Now we have to study the behaviour of the wave function in the outside region. For 
a given energy the wave function will in general have admixtures behaving both 
like l and for r — > 0 , but the latter will dominate its asymptotic behaviour. 
In this limit we can also neglect the mass as well as the energy. According to (3) 
the F /R ratio then reads 


F_ 1 + H = l + y/1- (Za) 2 

G r ^ R Za. Za 


(9) 


in the limit R — > 0. Comparing (7) and (9), we see that the matching condition 
becomes 


1 + y/l- (Za) 2 

ZOi 


= cot(Za) 

Za 


( 10 ) 


in this limit. It is impossible to solve this equation, since for a < 1 the lhs is 
greater than 1 /Za, whereas the rhs is smaller. We conclude that we cannot find a 
solution in the limit R — > 0, i.e. if the potential approaches the potential of a point 
charge, if the component r ^ dominates the behaviour of the wave function for 
small r. Only in the special-case limit where this component vanishes does (9) no 
longer hold and is it possible to get a solution. But this happens only if the energy 
approaches an energy eigenvalue. Thus one retrieves the usual energy eigenvalues 
of the Dirac equation, whereas no new solutions are found. 

The same result is found for a homogeneously charged sphere. The potential in 
the inside region is too small to produce an F /G which could fulfil the matching 
condition. We can convince ourselves of this in the following way: we can ask 
which value of the potential is needed on the inside in order to fulfil the matching 
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condition, assuming that there is a constant potential V 0 on the inside which is not 
equal to Za/R. The ratio F /G in the outside region is at most 1, and matching is 
only possible if the potential in the inside is larger than 2.04 /R (the solution for 
F/G = l). 18 

The fact that one can find solutions if the potential in the inside region is large 
enough is not surprising. Even without the 1 /r part on the outside one can get 
arbitrarily deeply bound states if only the strength of the potential is big enough. 
On the other hand, if one decreases the potential in the inside region, the energy 
of these solutions increases until they are finally turned into ordinary solutions of 
the \/r potential. Thus no new type of solution is found for the limiting case of a 
well-behaved potential approaching 1 /r variation. 


This solution also determines the energy eigenvalues for the MIT bag - see W. Greiner, 
B. Muller: Gauge Theory of Weak Interactions, 2nd ed. (Springer, Berlin, Heidelberg 1996) 
and W. Greiner, S. Schramm, E. Stein: Quantum Chromodynamics, 2nd ed. (Springer 
Berlin, Heidelberg 2000). 6 ’ 



10. The Two-Centre Dirac Equation 


The description of one or more electrons in the field of two nuclei is one of 
the fundamental problems in quantum mechanics. Among other things it contains 
the theoretical understanding of the phenomenon of chemical binding and thus 
is connects physics and chemistry. In 1927 W. Heitler and F. London 1 showed 
for the first time the possibility of the existence of the H 2 molecule. They used an 
approximation procedure and were able to calculate its physical properties (binding 
energy, binding length). 

Soon afterwards E. Teller 2 (1930) and E.A. Hylleraas 3 (1931) gave, indepen- 
dently of each other, methods for a mathematically exact solution of the nonrela- 
tivistic single-electron two-centre problem. Later, G. Jaffe 4 (1934) came up with a 
third solution. But the numerical evaluation of these procedures is very costly, so 
that the energies of the higher states can be calculated only with computers. Good 
nonrelativistic calculations in recent times - also including some deviations from 
the Coulomb potential - came from K. Helfrich and H. Hartmann 5 (1968). 

Calculations for many-electron systems, similar to the Hartree-Fock model, 
only became available in the early 1970s, e.g. those by F.P. Larkins 6 (1972). Here 
one uses a finite set of basis functions (one-, two-, or even three-centre functions), 
which are combined with the intention of finding the lowest total energy. In view 
of these difficulties, and the fact that experiments up to now were possible only 
in a small energy range (some eV up to a few keV), an extensive study of the 
corresponding relativistic equations did not take place. Solely S.K. Luke et al. 7 
have treated the relativistic corrections of the single-electron problem in first-order 
perturbation theory in Za. However, especially from the experimental point of 
view, one urgently needs a solution of the two-centre Coulomb problem which is 
exact in all orders of Za. Heavy-ion accelerators allow highly charged nuclei to 
approach for a short time so closely that even the innermost electron shells belong 
to both nuclei and form molecular orbitals (cf. Fig. 10.1). 

Now we consider collisions of heavy ions with kinetic energies near the 
Coulomb barrier, so that both nuclei can just touch. An overlap of the involved 
nuclei and the related inelastic processes shall be excluded here. If both nuclei are 
very heavy, it is possible to generate for a short period (r ~ 10“ 17 s) superheavy 

1 W. Heitler, F. London: Z. Phys. 44 , 455 (1927). 

2 E. Teller: Z. Phys. 61 , 458 (1930). 

3 E.A. Hylleraas: Z. Phys. 71 , 739 (1931). 

4 G. Jaffe: Z. Phys. 87, 535 (1934). 

5 K. Helfrich, H. Hartmann: Theor. Chim. Acta 10 , 406 (1968). 

6 F. Larkins: J. Phys. B5, 571 (1972). 

7 S.K. Luke et al.: J. Chem. Phys. 50, 1644 (1969). 
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Fig. 10.1. Quasimolecular orbitals. The projectile ion moves along a curved trajectory. The 
individual atomic orbitals reappear when the colliding ions are separated enough ( upper 

right). During the collision electrons (so-called 6 electrons - marked by t e_ ) and X rays 
(~*) are emitted. The emission is caused by the varying molecular orbitals, which adjust 
themselves with respect to the varied intemuclear separation 


quasimolecules with total charge Z = Z\ + Z 2 , which are far above the end of 
the periodic system: 107 < Z\ + Zi < 190. For these systems the velocity of the 
heavy ions is about 1/10 the velocity of light, while the “velocity” of the electrons 
in the inner most states is near the velocity of light. This becomes obvious if we 
remember that for Zcv ~ 1 the binding energy of the 1 s electrons becomes compa- 
rable with their rest mass. This means that the electrons of the inner shells during a 
heavy-ion collision have time enough to adjust with respect to the varying distance 
R between the two Coulomb centres. Thus the properties of the quasimolecular 
electronic orbitals are determined by the sum of the projectile charge Zp and the 
target charge Z- 

If the distance R between both colliding ions becomes less than the radius of 
the K shell, we call the system a superheavy quasiatom. For a Pb-Pb collision 
this situation arises for R < 500 fin. Induced by the varying Coulomb field during 
the collision, the possibility is given that inner-shell ionization occurs by excita- 
tion of electrons into vacant higher bound states or by direct excitation into the 
continuum. The highly energetic parts of these final-state continuum electrons are 
called “delta rays” or “delta electrons”, a term originating from when these ra- 
dioactive rays were discovered: At that time, one bombarded different targets with 
protons and discovered a highly energetic component of radiation which could not 
be explained by the classical laws of collisions. Therefore it was assumed that 
beside the know alpha, beta, and gamma rays, a new kind of radiation had been 
discovered; hence “delta rays”. Nowadays it is well known that the high-energy 
component of the momentum distribution of the bound electrons allows very much 
higher energy transfers that can be expected in accordance with the classical laws of 
collisions. Clearly, the mentioned momentum distribution of the quasimolecularly 
bound electrons depends on the distance between the colliding ions. 

These facts suggest the use of the measurement of delta electrons for the spec- 
troscopy of electronic states in these superheavy quasimolecules. Further possibili- 
ties are given by the measurement of the decay of the electron-hole pair which has 
been created during the collision. Indeed, during the collision the so-called quasi- 
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molecular 7 radiation will be emitted or - as long as the hole is filled a long time 
after the collision - the ordinary characteristic radiation of the single atoms can be 
measured. Some of these processes are illustrated in Fig. 10.1. Such spectroscopy 
allows the extension of the periodic system by a factor two with respect to the 
atomic number Z, at least as far as interior electronic shell structure is concerned. 
Furthermore, the search for the vacuum decay in supercritical fields (see, e.g. Ex- 
ercises 9. 5-9. 7 and Chap. 12) necessitates the precise knowledge of the electronic 
structure in the course of a heavy-ion collision. 8 Fundamental to all such inves- 
tigations is the solution of the Dirac equation for two charged centres. Therefore 
the energies and the wave functions of the electrons need to be determined for the 
given distance R between the nuclei. 

In 1973, the two-centre Dirac equation was solved for the first time by Muller 
and Greiner. 9 The solution was given in terms of prolate elliptic (spheroidal) co- 
ordinates £, 77, (f) with 

* =/ V^ 2-1 ) 0 -??2 ) cos<p > 

y = R \j (£ 2 - !) 0 - V 2 ) siny> , 

z=R£r} (10.1) 

defined in the domain 

1 < £ , -I<r?<+1 , 0<(p<2n . 

The problem is rotationally symmetric round the z axis; thus the component J z 
of angular momentum is a good quantum number and the eigenfunction in the 
(f variable is easily separated. It is given by exp(im^). However, the resulting 
differential equations in the variables £ and 77 do not decouple. The differential 
equations were diagonalized in the so-called Hylleraas basis, which is essentially 
composed of Laguerre polynomials in £ and Legendre polynomials in 77. 

Meanwhile two centre Hartree-Fock solutions were also obtained, by Fricke et 
al. 10 (1975). Here, however, we restrict ourselves to the study of a very successful 
procedure by Muller et al. for the solution of the two-centre Dirac equation, which 
is essentially based on a multipole expansion of the two-centre potential. 

For that purpose let us consider again the general form of the Dirac equation 
in polar coordinates. Using the relation 

a x (6 x c) = b(a • c) - c(a • b ) 

we obtain 

-e r x (e r x V) = —e r (e r • V) + V (e r • e r ) , (10.2) 

where e r denotes the unit vector in the r direction. Using this, the V operator may 
be written as 

8 See W. Greiner, J. Reinhardt: Quantum Electrodynamics , 2nd ed. (Springer, Berlin, Hei- 
delberg 1994) and especially W. Greiner, B. Muller, J. Rafelski: Quantum Electrodynamics 
of Strong Fields (Springer, Berlin, Heidelberg 1985). 

9 B. Muller, W. Greiner: Phys. Lett. B47, 5 (1973); Z. Naturf. 31a, 1 (1976). 

10 B. Fricke, K. Rashid, P. Bertoncini, A.C. Wahl: Phys. Rev. Lett. 34, 243 (1975). 
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V = e r ( e r • V) - e r x (e r x V) 
d i / e r f \ 

~ Cr dr h\r XL ) ’ 


(10.3) 


where L = —ih(r x V) is the orbital angular momentum operator. Hence it follows 
for the operator of the kinetic energy 


d 1 

ca-p= -i hca r - cd • (e, x L) , 

Or r 

with a r = d • e r . Now we use 

(d • A) (a • B) = A • B + \E • (A x £) = (E . A)(£ • £) 

and insert A = e r and B = L, which gives 
(d • e r ) (at • L) = e r • L + \E • ( e r x L) 

= (E-e r )(E-L) = E r (E-L) 

because 

e r • L = 0 , 


(10.4) 


(10.5) 


( 10 . 6 ) 


since e r is orthogonal to L = r x p. We multiply by -75 from the left, where 

, (10.7) 


7 5 


0 -n 
-b 0 


and remember that 

E = - 7 'd = -d 7 5 , a = - 7 'i; = -£7' . 

Hence it results that 
id • (e r x L) = a r (E • L) , 
and for the operator of kinetic energy 

. _ 1 (7 a, a a 

coc • p = —\hcoi r — — b ic — (E - L) 
or r 

Now it is convenient to introduce the operator K, defined as 
K = 0 (E.L + h) , 

and one obtains the stationary Dirac equation in polar coordinates: 


ic 7 5 E r ( h ^ K + ^ ) + V(r) + $m 0 c 2 


^(r) 


Let us analyze the properties of the spin-orbit operator K in more detail, 
show that it commutes with / 3 : 

r <*. A-1 A. A A A. /V A 

[*,/?] = 0 (E'L)P-E-L = O , 


( 10 . 8 ) 

(10.9) 

( 10 . 10 ) 

( 10 . 11 ) 

( 10 . 12 ) 
First we 

(10.13) 


because 
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& 0\ fl 0 

0 <ry ( 0 -11 

11 0\ f& 0 

0 -10 & 


<x 0 

0 -<x 

& 0 

0 -<x 


and 


Furthermore we have 

0(& -p) - (d -p)P = 2j3(a -p) , 

and also 

[0{E- L), a* p]_ = 0(E-L) (a-p) - (a-p)$(E-L) 

= p(E-L) (a-p)+0(&.p) (E-L) 

= 0[Z-L,6fp\+ • 

Now 

[E • L, a- p] + = -75 [L'p + p'L + iE'(Lxp + px L)] 
L • p — (rxp) • p = 0 , 

P-L = 0 , 

and we calculate further that 


L x p = 


e x Gy G z 

Lx Ly L z 


I Px Py Pi 

= e* {LyPz - L z Py) - e y ( L x p z - L z p x ) + e z ( L x p y - L y p x ) 


pxL = 


C y e-z 

Px Py Pz 

L x Ly L z 


(10.14) 


(10.15) 


(10.16) 


(pyL z p- Ly ) e y (p x L z p z L x ) + b z {p x Ly p y L x ) . ( 10 . 17 ) 

Summing, we obtain 

A A 

L x p + p x L 

~ (Lypz ~~ Pz Ly — L z py + py L z ) — Gy (L x p z — p z L x — L z p x + p x L z ^j 

+ e z (L x py — PyL X — Lypx + p X Ly'j ( 10 . 18 ) 

Now we look at the first commutator of the e x component: 

[4>PzL = ( r x Ph'Pz ~ Pz( r x P)y 

= -Xp z p z + Zpxpz + Pz x Pz -PzZpx 

= zp x pz-p z zpx • ( 10 . 19 ) 

By use of 

[xi,pk}_ = \h6 lk , 
it follows that 

I Ly,p z ] _ = Zpxpz + i ftpx ~ Zpzpx = i hp x • 
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In an analogous way the second commutator of the e x component is obtained: 
- [Lz,Py\_ = - {(r X p) z py -p y (r x p) z } 

= ~ x PyPy + yPxPy +PyXPy -p y yp x 
= yPxPy - p y yPx = ypxPy + i hp x - yp y p x = i hp x . 

In vector notation we can generally sum up and get 

Lxp+pxL= 2i kp (10.20) 

Finally, for the commutator of K with the kinetic energy we obtain 
[£, ca • p] _ = 2hcP(a • p) + • ( 2ihcp ) 

= hc(2fc&-p) + 2fa' 5 (i:-p)) 

— hc(2/3(a • p) - 2(3(oc • p)) = 0 , (10.21) 

i.e. K commutes with the Hamiltonian for free particles. The commutator of K and 
j can be obtained in a similar way, namely 

[P(Z-L),L + \h£]_= p[E'L,L]_ + \hP[I!'L,Z;]_ . (10.22) 

First we investigate the second commutator, obtaining 
[E • L , E]_ = (E • L)E - E(E • L ) 

= ( E X L X + EyLy + E Z L Z ) (£^5 ^yi^z) 

— (£*> A- 5 E z ) ( E X L X -j- EyLy E Z L Z ) 

Now we use the assertion 

[-57 • L , 17] _ = 2i(-57 x L) , (10.23) 

verifying this for the x component. (The proof for the other components proceeds 
analogously). Thus 

( EyLy + E Z L Z )E X — E X (EyLy + E z L z ^j 
— i E z Ly -j- i EyL z iE z Ly + \EyL z . 

On the other hand 

i[ExL] x =iE y L z -iE z L y (10.24) 

holds, where we have used the following relations for the Z’-matrices: 

E y E X — 5 = 

Hence the relation (10.23) stated above results, and, furthermore, we suspect the 
validity of 

[E • L , L]_ - ~ih(E x L) . (10.25) 

By use of the commutation relations of angular momentum 
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LyL x LxLy — ifiLiz j L Z E X — i HLy , 

the x component of the commutator is found to satisfy 

2JyLyL x ' X/ z L z L x — L x E y L y — L X E Z L Z — — i HE y L z -f- iH E z L y 
The x component of the vector product yields 
—ifr(E x L) x = — i K(£ y E z - E z L y ) , 
so that in total we have 

(3[Z-L,L]_ = -ihj3(E x L) , 
and finally we obtain 


K,j 


0 


(10.26) 


(10.27) 


For K 2 it follows that 

A a A A A A a A A A A A a 

K 2 = (E • L + hf = L 2 + iS • (L x L) + 2hS • L + /i 2 

A A A A a 

= .L 2 + HE • L + h 2 . 


(10.28) 


Now we denote the Dirac equation for the Coulomb potential of two centres of 
charge, Z\e and Zie , separated by the distance R, as 


Z,£ 2 


Z 2 e 2 


ln ' 5 ^ r \ h d r + r r^J \r-R/2\ \r + R/2\ 
= E^ir) . 


4- j3mc,c 2 


Mr) 


(10.29) 


The magnetic quantum number /i is the eigenvalue of the projection of the total 
angular momentum onto the axis connecting the two nuclei (z axis). The nuclei 
are assumed to be point-like, and at this stage a multipole expansion of the wave 
function suggests itself: 


W r,-E^)-E(£fc 

K K- V 


(10.30) 


where f K (r) and g K (r) are the radial wave functions; k was defined by 


K 


/ for j = / — 5 


-/ — 1 for j = l + - 
and the spinor spherical harmonics are given by 
Xk , l i = - m > m ) 

m=± j 

Now we have the unity spinors Xm 


(10.31) 


(10.32) 


V 0 


and X- i = 
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Next we apply various operators to the two spinors and get 



& Xm = > 

J = hj(j ~b l)X/C,/i 5 

5 2 X.,, = ft 2 * fa + l)X«,/x = , 

L 2 Xk,h = $1(1 4- l)x«,/z > 

2 L.S X ^ = (J*-P-&)x^ 

= $ [j(j + 1) - /(/ + 1) - S(s + 1)] x«,Ai • (10.34) 

With her • L = 2 S • 1/ , it follows that 

(<y • Z, + ft) X«,/x = ft[/(/ + 1) - /(/ + 1) - s(s + 1) + l]x«,/x • (10.35) 

This relation can be evaluated in both cases of (10.31): 
a): 

l = j + \ , = / = 7 + ^ 

and b): 

l —j — \ — 1 = ~ j + 5 ~ 1 

= ~ 0 + 3 ) = “M ’ 

with the result 
a): 

j(j + 1) ~ Id + 1) ~ *(* 4- 1) + 1 = (/ — 5 ) (/ + 5 ) — /(/ + 1) — | + 1 

= l 2 — \ — l 1 — 1 + | = — / = — ft , 

and b): 

7(/ + 1) - I'd' + 1) - j(* + 1) + 1 = (/' + £) (/' + §) - /'(/' + 1) + \ 

— 2 V ~ l' - 1- 1 ^ = Z ; + 1 = —k, , 


(10.36) 
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so that generally 
(a • L + h)x K ,v = 


(10.37) 


holds. We now move on to investigate the multipole decomposition of the two- 
centre potential. It is generally known that 


1 1 

|r — r'\ ^/r 2 + r a — 2 rr' cos 7 

where 7 is the angle between r and r' and 


1 

v — r' 
1 

r + r' 


r >\J l ~ 2 r t C0S ^ + {^) 
1 

r > J l + 2 77 cos 7+ (77) 


(10.38) 


with 


r< = min(|r|, |r'|) , r> = max(|r|, |r'|) . 

Expansion of the root yields 

1 00 1 

uTT^TT = 7TT P ' (C0S ^ 

|f 1 1=0 r > 

and correspondingly 


1 00 l 

17^7 7 i = E(- 1 ) / ^r p '( cos ^ ’ 

^ ' 1 /=o r > 

with the Legendre polynomials 
Po(x) = \ , 

P\(x)=X , 

P 2 (x) = l(3x 2 - 1) , 

etc. 

In general, the recursion relation 

(/ + 1)P /+1 (jc) - (21 + 1) x P,(x) + lPi-\(x) = 0 

holds. For point-like nuclei the two-centre Coulomb potential reads 

Z,e 2 Z 2 e 2 

V ~ ~\r-R/2\ ~ \r + R/2\ ’ 


(10.39) 


(10.40) 


(10.41) 


which, after expansion into multipoles, yields 
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OO i 

V(r, R, 7 ) = -Z,e 2 g -V_--P l(cos 7 ) 

OO l 

- ^ (R/ 2V+1 P/(C0S 7) forr< § > 

V(r,/?, 7 ) = -Z^ 2 f; ^^P ; (cos 7 ) 

/=0 

°°^ (R/2) 1 

-■Z2« 2 ^(-iy — f+^-Pi (cos 7) for r > | . (10.42) 

/=o r 

The monopole part for point nuclei of the two-centre Coulomb potential can 
easily be derived from the multipole expansions, and is given by the l = 0 term 
of (10.42), i.e. 


Vo(r) 


(Zi +Z2)e 2 R 

-2 for r < — 


(Z, + Z 1 )e 7 


f ^ R 

for r > — 
~ 2 


(10.43) 


Remark on multipole expansions of potentials: We write generally 

00 00 

V(r,R) = Y V,(r,R)P,(cos0) = Y Vi(r,R)P,(x ) , with 
1=0 1=0 

x = cos d , 

and use the following normalization condition: 


(10.44) 


/: 


PKx)P,(x)dx = — — 

— l Zl i 


Sr 


VI 


(10.45) 


Here d is the polar angle of r — {r sin i9 cos ip, r sin d sin <p, r cos 1?}. Multiplying 
by Pi(x) and integrating over x from —1 to +1 yields 


/_ 


+1 2 
V(r,R)P l (x)dx = V,(r,R)—— 
1 Zl -j- 1 


or 


V,(r,R) = 


21 + 1 f 1 

— 2~ J V(r,R)P,(x)dx 


(10.46) 


Now we will show that the operator K has the eigenvalue -tin: 

K <t>K,r( r ) = P (£ • L 4- h) 

n 0 \fa-L + h 0 \f 
0 -lj\ 0 &-L + h) \if K .(r)x- K „»(ti,<p) 

&-L + H 0^ \ / St«(r)x^(r),v?) 

0 -(&-L + K)) \\f K (r)x- K}IJ ,(S,(p) 

-ng K .i.r)x K .,r{'S,^P) 


= h [ ) = (-hK)<t> K Jr) 

-KiMr)x-nA^<P)/ 


(10.47) 
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In order to obtain the differential equations for the radial wave functions we 
write down the stationary two-centre Dirac equation (10.29) once more, though in 
greater detail: 


i c 


o -n\ ( o r o \ A d h hn f n o\ 
-1 o)(o a ,) + 7 + T (o -i) 


+ v ‘ p > + 
1=0 






V ( Eg K (r \ 
2-, \iEf K (r) X -«,J 


(10.48) 


where the two-centre potential in the form (10.44) has been inserted. Equation 
(10.48) can be further transformed into 


ihc 


° ~°r \ { + L + 1 ( l 

-o r 0 ) \dr r r \0 


1=0 x 


0 

n 

-i ioo 


moc 2 — E 


E 

zt 1 


g K (r)XK,n 

iMr)X-*,n 


= 0 


hc^fndrX-K,^ + hc^&rX-K^ ~ ^ 7 /AX-M + 0 V iPi9kX^ + { m 0C 2 ~ &) SkXk.A 


zb 00 

d 

K =±1 \ -ihc^-g Ki d r XK,n, - ifc^ydrXw ~ i ftc^g K <j r XK,n + J2Ho v iPi\f^X-K,n ~ {me 2 + E) 

(10.49) 


Later on we shall prove the relation 

= — X — K,fJi > 

from which 


±00 

E 


he he r xk,li “i - he 


+ ^2 V i p i9k,Xk,h + {me 2 - E) g^x^ 


1=0 


= 0 


(10.50) 


±00 r 

E 

Ac=zh 1 L 


t d g K k 

hc-g^x-w ~ te—X-it,fi ~ m-9k.X-k.,h 


~ ^2 V,P,f K X-K,n + {me 2 + E)f K X-K„ 


1=0 


= 0 


(10.51) 


follows. 

According to (10.32) the spinors Xk,h are orthonormal. Using this property 
and multiplying the first differential equation of the two-centre Dirac equation by 
— (Xk iM | and the second one by — (k_« )/ x |, we obtain 
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d f— Tl 00 ±OG 

+ (me 2 -£)&-'£* E s.tejfilw) 

/ =0 K;=i 1 

= 0 , 

d Q- K 00 ±oc 

^dr^ + + (moc 2 + £)/« + ]TV ; ^ /^(X-Tc^I^/Ix-^m) 

(=0 K=±I 

= 0 • (10.52) 

This - in principle infinite - system of coupled radial first-order differential equa- 
tions must be solved numerically in order to determine the energy E. 

As a special case we examine these differential equations for a spherically 
symmetric monopole potential V(r) = V 0 (r) for a special /t; therefore 

hc ^ + nc 7 : ~ hc ^~ (™° c2 -£)s- V 0 g = 0 , 

h£ '& 9 + hc ^ + hc ~ 9~ {moc 2 + E)f + V(J = 0 . (10.53) 

These are the already known radial differential equations for spherically symmetric 
potentials (cf. Example 9.3). The matrix elements {Xk^\Pi\Xk,h) can be obtained 
easily by the usual angular momentum algebra, though we will not pursue that here 
any further. 

However, to conclude our formal derivations we will prove the relation 


3 

°rXK,ti = -X-K.M with rd r = 'Y^x i a i , (10.54) 

;= 1 

where o r is a scalar operator, so o r x K ,n has the same eigenvalues j and /j as 
Xk,m- F* rst we note some properties of the parity of a state. Parity is determined 
by the transformation properties of the spherical harmonics. For the transformation 

(l9, if) -> (7T - -!?, ifi + 7T), 


Y1A 1 r - + = (-\) l Yi, m (ti,ip) (10.55) 

follows. Hence the parity of a state is determined by the orbital angular momentum, 
and is thus given by (-1)'. Therefore we can make the ansatz 


OrXK,n = ax-^ + bx K<fi ■ (10.56) 

As o r changes its sign under parity transformations, b must be zero; thus we can 
write 


7T, = (-1)' = (-1)0 + 1/^) s 

too, with S K = k/\k\ and 

K = / 1 for ; = / - i 

\ - 1 for j = l + \ 

or 

^ _ / k for k > 0 

l — k — 1 for k < 0 


(10.57) 


(10.58) 
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Furthermore we define the 7 related to — k: 

1 — k — 1 for k > 0 , 1 — —k for k < 0 , and 

Z — 7 = 1 = Sk for k > 0 , 

/ — 7 = — 1 = S K for k < 0 , 

holds. Hence we can write the total angular momentum as 

j = i - \s K , 

which we have used already in (10.57). 

Changing parity yields, for constant j, a change of the orbital angular momen- 
tum by one unit. Hence the sign of k changes under the parity transformation, 
too. 

With 


(<r • A)(& • B) = A • B + i&(A x B) 
we find 

a 2 = 1 (10.59) 

and therefore 
a 2 = 1 . 


Consequently, the phase of a remains to be determined. For that purpose we 
choose e r along the z axis and set i? = 0 in the spherical harmonics. With 




2 / + 1 (/ - m)\ 
47T (/ 4- m)\ 


Pt, m ( cost?) 


e imv> 


and 

Am 

p ltm ( X ) = c-i> m (i -x 2 r / 2 —p,(x) 

we obtain for $ = 0 

1 21 + 1 

Moreover, we have 

v r =Vz = (l , with 


&zX i = and 

Thereby we obtain for fixed ji\ 
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If we use 2(i as the eigenvalue of o z , we arrive at the defining equation for a: 
a V2l+l (7 \j |0/x) = 2/xv^T+T (Z \j\0p) . 

It is useful to investigate the four cases j = l ± \ and (i = ±1 separately: 

(1 )j = l + h V= l 2- 

l — l S K = / + 1 ; 

hence it follows that j =1 — j, and 


0 v^(-i)^n=v5rrry|±I . oo.eo) 

From this we get a = — 1. 

(2) ;=Z + I, /x = -i: 

a yji = -VZ + 1 , a = —l . (10.61) 

(3) 7 = Z- i, (i=\: 

J = l-S K = l-l . 

From this follows j =1 + ± and 

ciVl + 1 = -V7 , a = -1 . (10.62) 

(4) ; =Z — i, /x = -i; 

+ 1 = , a = - 1 . (10.63) 

This proves the assertion of (10.54). By numerically solving the coupled differ- 
ential equations (10.52) one finally obtains the Z?-dependent wave functions 0, (r, R ) 
and the energies E t (R). The latter are usually represented by a so-called correla- 
tion diagram, where the energies of the states of the separated systems ( Z x , Z 2 ) 
are connected to those of the combined system (Z\ + Z 2 ). The molecular states 
are classified according to the good quantum numbers j z , with the eigenvalues 
M = \i |> 2 j ■ • •> which are also specified by o, 7r, 6, . . . . In most cases one assigns 
to the molecular state, in addition, the quantum numbers of the appertaining state 
in the combined system R = 0, so that altogether we obtain the designation 

1j,/ 2 <t , 2pi /2 a , 2p 3/2 (J , 2p 3/2 n ... . 

In the case of identical partners in the molecule, there exists a further constant 
of motion. Indeed the parity operator commutes with the Hamiltonian and with 
j z , so that additionally one can distinguish between even (positive parity) and odd 
(negative parity with respect to the centre of mass of both nuclei) states. 

As an example of a relativistic correlation diagram we show the calculated 
binding energies of some bound states in the Pb-Pb system (Figs. 10.2-10.4). In 
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6 = 2si/2<7, e ~ 2^3/2 7T , /i = 3p3/2CT, k = 3d3/27T> n = 4si/i(r> 

C = 2/?i/2<7 , / = 3si/ 2<7, i = 3p3/27T, / = 3^5/2 <T , O = 4pi/2<7 , r = 4cfi/2Cr, 


5 = 4<i 3 /27T , 

£ = 4<i5/2<7 , 
u — 4d$(2 n • 



order to emphasize the various dependences of the energy eigenvalues on the two- 
centre distance R we have chosen different representations of the Pb-Pb correlation 
diagram. First we have displayed in a double logarithmic scale the 21 lowest 
a (full lines) and 7 r states (dashed lines) in the range between R — 15fm and 
R = 3000 fm (see Fig. 10.2). The relativistic fine-structure splitting between the 
states 2p 3 / 2 cr and 2p\/ 2 a at R = 15 fm with the magnitude of about 316.6 keV 
is especially noteworthy. The finite extension of the Pb nuclei was considered 
in these calculations, too; however, the interaction between the electrons was not 


Fig. 10.2. Correlation dia- 
gram for the (symmetric) 
Pb-Pb system with double 
logarithmic scale. The ab- 
scissa shows the two-centre 
distance /?, the ordinate 
shows the binding energy 


Fig. 10.3. Correlation dia- 
gram for the lowest levels 
in the Pb-Pb system in a 
linear scale illustrates the 
strong increase of the bind- 
ing energies for small dis- 
tances ( R — > 0) 
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regarded. Furthermore the delayed crossings between the 3p\/ 2 a and the 2p 2 / 2 a 
state at R — 18fm and between the 4p\ j 2 a and the 3p 2 / 2 a state at R = 15.5 fm 
are interesting. The energy levels in the squares I and II between 300 fm and 
1000 fm are displayed separately with a linear scale (Fig. 10.3) for greater clarity. 
The rapid change in the energy of the most strongly bound electrons in the range 
of small two-centre distances is most impressive if one chooses a linear scale for 
the representation of £ as a function of R (see Fig. 10.4). Here the left column 
(ordinate) represents the binding energies in the combined atom with Z = 164. 
In the range 15 fm < R < 100 fm we find the following energy changes: 330 keV 
for the Isa state, 208 keV for the lp\/ 2 a state and 91keV for the 2s a state. 
For comparison, we mention that the 1 j binding energy in the element Fermium 
(Z = 100) “only” amounts to 141 keV, while it is close to 900 keV for Z = 164. 
The rapid increase of the binding energies of the interior electron shells with 
decreasing two-centre distance R reflects the fast approach of overcriticality in the 
limit R — > 0 of these superheavy systems. This is most important with the “decay 
of the vacuum” in supercritical fields. 


Fig. 10.4. Expanded sections 
from Fig. 10.3. The noncross- 
ing of levels with at least one 
different quantum number is 
clearly noticeable 



11. The Foldy-Wouthuysen Representation 
for Free Particles 


As discussed in the previous chapters, the spinors in the Dirac theory consist of four 
components. In the nonrelativistic limit, for spinors belonging to positive (negative) 
energy states, the upper (lower) two components become large compared to the 
lower (upper) two components [cf. (2.44), (2.72)]. The question arises, whether 
there exists a representation that reflects this property as a general feature, i.e. also 
for large velocities of the particles. 

Thus, we will search for a unitary transformation that, when applied to the 
wave function of a free spin-^ particle, yields for fixed sign of the energy a wave 
function completely determined by only two of the four components. However, 
as mentioned before, in the Dirac representation the wave function is generally 
determined by four components. In the following we will first discuss two different 
possibilities of such a unitary transformation and subsequently we will illustrate 
the transformation of some operators into the new representation, called the Foldy- 
Wouthuysen representation or, alternatively, representation . 

The change from the original Dirac representation to the $ representation can 
be achieved by a unitary transformation 


U = 


0Hf + E p 

\J 2E p ( m 0 c 2 + E p ) 


( 11 . 1 ) 


where Hf = cot • p denotes the Hamiltonian of a free particle, as before. 

If we use a\ = a k , ^ = /3 and take into account that p is a Hermitian operator, 
we can prove that U is indeed a unitary operator: 


U'U = 


2 E„ (m!c> + E„) ' P + m ° c2 + ^ ■* + "*<? + e p) 

2E, (mtct + E„) {C& ' ' ^ + ^ ^ ^ P + m ° C 2 + E p) 

2 + E,) ( ^ 2 + ^ ' P 

+ ca. • p/3(moc 2 + E p ) + (m 0 c 2 + E p ) 2 j 

2 E (me 2 + E ) ^ ~ + ^ + 1E P m ^ 2 + E D = 1 • ( ! ] 


Sometimes U is specified in a slightly modified form, which we quote here for 
completeness: 
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m + e„ 


pa • p 


\J^E p (m 0 c 2 + E p ) \J^E p (m 0 c 2 + E p ) 

A 


+ 


2 E n 


1 


moc 2 + E P 2 


1 1 ( x , m oc 2 

2 V E n 


1 +P-, 


P /l 




moc * 


m 0 c 2 + E p 
2E p 


J 


(11.3) 


where p = |p|. 

The wave functions in the $ representation are related to the original represen- 
tation by 

<j> = lhp (11.4) 


Simultaneously, all operators need to be transformed according to 

A# = 0A0* . 


(11.5) 


Since the operator of momentum p commutes with U , it remains invariant 
under the transformation, i.e. 

P*=P • (11-6) 

With respect to (11.5) we transform the Hamiltonian Hf into <P representation: 

P (cd • p 4- Pmoc 2 ) +E p - 2 \ (ca • p + f3m 0 c 2 ) P + E p 

— (ca • p + /3m 0 c ) - 


H# = 


yj2Ep ( moc 2 +Ep) 


\j2Ep (/ n 0 c 2 +E P ) 


(11.7) 


It is useful to evaluate H 2 first. Using the relation P&k = —&kP, we find 

H 2 = (cd • p + j3moc 2 ) (cd • p + Pmoc 2 ) 

= c 2 p 2 + c(d • p)Pmoc 2 + pmoc 2 ca • p + /MqC 4 = c 2 p 2 -I- /MqC 4 


= E 2 = E 2 


where the last equality is justified if the operator H 2 acts on plane waves. Further- 
more we evaluate 


f3HfP — p (cd • p + Pmoc 2 ) p = — cd • p + Pmoc 2 = 2moc 2 p — Hf , 

{PE 2 + EpHf) (HfP + Ep) 


and hence, we finally have 
H<t> = 1 


2 E p (moc 2 + E p ) p 

1 


2E p (m 0 c 2 + E p ) 

1 

2 E p (m 0 c 2 + £),) 

2 E 2 p (m 0 c 2 + £ p ) 
2£), (woe 2 + £ p ) 


(PE 2 Hfp + PE 2 + E 2 P + EpH ( Ep) 
(E 2 2m 0 c 2 p + 2pE 2 ) 

= E p p . 


( 11 . 8 ) 



The Dirac equation 

Hf'ip = e'lp 
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(11-9) 

now reads 

(3E P 4> = e<j) . (11.10) 

This concept has already been illustrated in Chap. 1 for the Klein-Gordon equa- 
tion (cf. the Feshbach-Villars representation). Now we want to introduce a differ- 
ent method to find the Hamiltonian in the ^ representation, originally proposed 
by Foldy and Wouthuysen. Again the idea is to search for a unitary transformation 
U which will remove from the Dirac equation all operators of the type & that 
couple the large to the small components. We will term any such operator an odd 
operator , e.g. a, 75, whereas an operator that does not couple large and small 

A A A 

components, such as H, (3 , 27, is called an even operator. 


We write 


<j) = U x/j = e lS ^ , where 

(i1.1i) 

uW =c~ iS c iS = n 

(11.12) 

and S is a yet unknown Hermitian operator. From 


^ d(j) 

(11.13) 

it follows that 


H<p e 1 ^ ip — i e 1 ^ = ^ Hip — e 1 ^ . 

dt dt 

(11.14) 

Multiplication by e~ lS from the rhs results in 


= e s H e - h— . 

dt 

(11.15) 


If S is explicitly known, this relation enables us to determine the Hamiltonian 
in ^ representation H<p from the original Hamiltonian H . In the following we will 
restrict ourselves to time-independent transformations 5, i.e. we presume dS /dt = 
0. For S we try the following ansatz (note that the field-free case, as considered 
here, implies that one simply may move over the momentum space representation, 
where p = p)\ 

s = - (-Z — ) 0a • pw (— ) , (11.16) 

\2m 0 c J \m 0 c J 

where the function u(p/moc) will be specified later. It is obvious that S is Her- 
mitian. For the Hamiltonian of free Dirac particles in the $ representation it holds 
that 

H<p = e 1,5 ( cot • p + (3moc 2 ) e~ ,S = e‘^/3 (c/3a • p + m^c 2 ) e -1,? 

= e i§ (3e~ lS j3 (ca • p + fimoc 2 ) , 


(11.17) 
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since S commutes with 0p • a. Using the relation 

em-py = (-ir(pa-p) n p , 


we can write 


where we have expanded e ^ in a power series. Hence (11.17) becomes 
H<p = e 21 ^ (ca • p + pmoc 2 ) 


Next, we expand e 2 ' 5 , i.e. 

e 2 '^ = exp ( — * P lo f 

\ mo c \m 0 c 


y( 1 \ n (0a-p) n 

f^ 0 \moc) n\ 


j + P& ‘P ^ + / 0a-p \ 2 ^ + f -p \ 3 


moc ) 2! \ m 0 c ) 3! 


(3a ■ p\ uj 4 

-^ r ) ^ + - • 


With regard to the expansions 


- i 0’/^) 2 “ 2 | < p /* w ? iS 
\moc J 2! 4! ’ 

P \_P & -P T P .. [( p/moc V] 3 _ 

sin i uj I — 1 uj -j- # , . 

P \moc J p m 0 c 3! 


and (0a • p) 2 = -p 2 , we see immediately that H<p can also be written as follows: 

= [ cos (^“) + sin 0^“)] [c& ' p + /W2] 

— 0 [woc 2 cos ( -^—S\ + cp sin ( 


(11.23) 


a-p p \ 2 ■ P 

H pc cos uj — moc sin { - J — uj 

P L V m o c J \moc 


We will have reached our aim to remove all odd operators, if we can specify ui 
such that the bracket ~ a • p/p is eliminated. Rearranging this expression yields 


I P \ 2 ■ I P 

pc cos u 1 — moc sin ui 

\moc J \m 0 c 


= pc cos 


P \ r, m 0 c sin[(p/m 0 c)cj] 

LO I 1 — 

moc J [ p cos[(p/moc)u] 


(P \ moc t ( p 

— pc cos u> 1 tan - J — oj 

\moc J [ p \moc 


(11.24) 
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If we now choose 


m 0 c ( p 

uj = arctan 

P \m 0 c 


it follows that 


pc cos 


~UJ 


m 0 c 


, me l Ip 

1 tan ( arctan ' 


m$c 


= pc cos 


m$c 


uj)[ 1 - 1 ] = 0 , 


i.e. with our choice of uj the odd part of H<p vanishes and we get 

H# = (3 


moc 2 cos ( —— — uj ) + cp sin ( 

rriQC J \m 0 c 


Using the trigonometric relation 


x 1 

arctan x = arcsin — 7 == = arccos 


\/l + x 
which holds for x > 0 , then 

1 


vTT. 


H# = {3 


= (3 


moc 


m 0 c 


+ cp- 


p/m 0 c 


yj 1 + p 2 /m%c 2 yj\ +P 2 / 


mfic 2 


m 0 c 


+ 




\Jp 2 + m^c 2 \Jp 2 + m%c 2 

flcsjp 2 + m 2 c 2 = f3E p . 


(11.25) 


(11.26) 


(11.27) 


Obviously this result is the same as in our previous calculations [see ( 11 . 8 )]. 

Next we calculate the sign operator A in the ( P representation. A was defined 
as [see (2.48)]: 


A 


Hf ca-p + j3moc 2 


c^Jp 2 + m%c 2 
or in the momentum representation: 
cat • p + $mQC 2 


(11.28) 


A = 


(11.29) 


By use of H& we get 

A* = 0A0' = $ . 


(11.30) 


For the following calculations it is necessary to know the C P representation for the 
operator &. Therefore we calculate 
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(3a -pc + m 0 c 2 + E p . * 2 , r \ 

= t/at/ T = — — — -r-^a (a • pc (3 + me + E p ) 

2 E p [mc 2 + E p ) 

c 2 (a • p)a(a • p) ^ c(3(a • p)a + a(a: • p)c(3 a ( m 0 c 2 + E p ) 


2E p (mc 2 +E p ) + 2£ p 

and replace 

c 2 p 2 = E 2 - m^c 4 = (Ep + me 2 ) (Ep - m 0 c 2 ) 
By use of the commutation relation 
a k ai + d/dt = 26 u , 
follows 

c 2 (a • p)a(a • p) 2c 2 p(a • p) 


2 E„ 


+ 


7 7 A 

c z p z a 


2E p (m 0 c 2 + E p ) 2E p (me 2 + £ p ) 2E p (m 0 c 2 + E p ) 


and 


cj3(a • p)a + a(a • p)c(3 _ c(3(a • p)a + 2c pp — c(3(a • p)a c$p 
2 E p 2 E p E p 

Next we collect terms: 

(E 2 - mfic 4 ) a + a (mfic 4 + E 2 + 2 E p me 2 ) a (2 E 2 + 2E p me 2 ) 


and finally get 
a<p = a — 


2 Ep (me 2 + E p ) 


c 2 p(a • p) c$p 


2 Ep (me 2 + Ep) 


Ep (Ep + me 2 ) Ep 


= at, 


(11.31) 


By a similar calculation it can be shown that the even part of the position 
operator is given in the ( P representation by 


r , rVr iftt Hc 2 (E X p) 

[r]< p = U[r]U' = r t tt- , 

2 E p (E p + me 2 ) 


where 


1 , » ihcA A i hc 2 p 

[r] = - (r + ArA) = r + — -a - 


2 E 2 


while the odd part reads 


r i ,,f i„t ihc ( , c 2 (3(a • p)p 

{ r }& = U {r}U' = — ( a(3 + 


2 E„ 


Ep (E p + me 2 ) 


(11.32) 


(11.33) 


(11.34) 


with 

{r} = j(r - ArA) . 

For calculation of r<p , the relation 

r# = OrO* = r + ihO (V p 0') r-iW p 
is quite useful (see Exercise 1.18). 


(11.35) 
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Finally we want to transform explicitly the Dirac plane waves into the $ rep- 
resentation. The projection operators f[+<p and 77_^, which project out from an 
arbitrary Dirac wave the states with positive or negative energy respectively, read 
in the $ representation: 


i7+* = |(i + p) , n = i(i - p) 
where ^(1 + P) has the explicit form 


(11.36) 


1 

+ 

z 1 

0 

0 

°\ 

1 -p 

/° 

0 

0 

0 

1 

0 

0 

0 

0 

0 

2 1 

0 

0 

0 

0 

’ 2 

0 

0 

1 


\0 

0 

0 

0/ 


\0 

0 

0 


(11.37) 


With these relations we may show that in the $ representation the wave function 
for a given sign of energy is completely described by two components. In the 
usual representation the orthonormalized wave functions for states with a given 
momentum in the z direction, given sign A(= +1 or —1) of energy and given spin 
projection & *p — \ or — follows from (2.34): 

/ 1 \ 

0 

cp 


VV,A,l/2 - 


^P, A,~ 1/2 - 


moc 2 + e 


2e 


moc 2 + e 


l£ 


moc 2 + £ 
0 
0 
1 
0 

-cp 


Jpz/h 


(27T h) 3 / 2 


£ — A E„ 


p-a=p 


^ipz/h 

(inR) 3 / 2 


p- a = -p 


(11.38) 


x mQC 2 + s 

Transforming according to 0 — U ip, we need 


(3a = 


0 

a 


cr 

0 


0 

—& 


cr 

0 


and 


(J 7 = 


1 0 
0 -1 



and, using c 2 p 2 = ( E p + moc 2 )(E p — moc 2 ) we obtain 


<; b=U'i/j = 


0 

0 

Vo/ 


l 


Jpz/h 


e ipz/n 


( 


2 E p {l-Khf/ 2 


c 2 p 2 


\ 


ni(,c 2 + £ 

0 

-cp + cp 

0 


+ moc 2 + Ep 


\ 


(2nh) 3 / 2 


a • p = p and 


(11.39) 
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1 - 1/2 


1 gipz/h 

2 E p (2nh)V 2 

/°N 


( 


c 2 p 2 

2 h ttloC 2 + Ep 

m 0 c “I - Ep 

0 

cp — cp 


0 


\ 


J 


1 
0 

Vo / 


gipz/fi 


( 2 tt ^) 3 /2 

\ V' / 

The states with negative energy read 

/ 1 

E p - moc 2 

Vv, -1,1/2 — \l 2pr 


(11.40) 


\ 


0 

cp 


Q ipz/h 


E p - moc 2 

^P,- 1 , — 1/2 — \ 9 c- 


E p - m 0 c 2 

Vo/ 
/ 0 


' p '* = '’ and 


2£„ 


1 

0 

cp 


\ 


gipz/h 


' E p - m 0 c 2 / 

From (j) = U & we calculate 


Q i / 

(2tt^) 3 / 2 ’ P ,<T = ~P ■ 


( F C P 2 + ffJ ° C ' 2 + £ /> \ 
£p - Woe 2 1 

0 

(/n 0 c 2 + Ep) (-cp) 


1 \jE p — niQC 2 t'P z ! h 

<P P ,-x,m = W p ^/ Ep + m Q c 2 (2 itK)V2 


-cp + 


The first element of this column matrix is zero, the third element results in 
—2 cpE p /(E p — moc 2 ) and we get 


y/Ep - mpe 2 _ E p - mpc 2 

y/Ep + m 0 c 2 y/Ep + m 0 c 2 y/E p - m^c 2 

_ E p — moc 2 E p - moc 2 
\/Ep- (me 2 ) 2 CP 

This yields 


tp- 1,1/2 


°\ 

0 

-1 

V 0/ 


Q ipz/h 

( 2nh ) 3 / 2 


and, similarly, 


<t>P, -1,1/2 


0 \ 

0 

0 

V -i / 


e ipz/fi 

(27T/i) 3 / 2 


&‘P = P , 


& • p = p 


(11.42) 


(11.43) 
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(11.44) 


Now, ^ can always we written as 

*-* ++ *-=($) ■ 
and using the projection operators one obtains 

= (it + .) <P = , *- = (£.,) * = , (11.45a) 

where w(p ) and v(p) can, in fact, be written as two-component functions 

(11.45b) 


wip) = ( W ' ) and v(p) = ( V ' 
\w 2 J \v 2 


11.1 The Foldy-Wouthuysen Representation in the Presence 
of External Fields 

If external fields are coupled to a Hamiltonian H , then H always contains parts 
coupling together the free positive and negative energy solutions. In the case of 
weak fields the role of the odd parts of this operator can be neglected. The Foldy- 
Wouthuysen transformation, as an approximation of the exact solution, is therefore 
only applicable to weak fields, where it systematically improves the approach. In 
performing the transformation we are guided by the example of the case of free 
fields. Again, we split up the Hamiltonian into 

H = /3moc 2 + i + O , (11.46) 

where j3 corresponds to the remaining even part of H (i <=> even) and O to the 
odd part ( O <£=> odd). Inclusion of external electromagnetic fields yields 

0 = cct(j)— and + i = eV (r) , (11-47) 

where V(r) is the Coulomb potential. In analogy to the case of free fields (11.16) 
we introduce a tansformation of the following kind: 

H ' = e i5 #e‘ i4 (11.48) 

with the intention of minimizing the odd parts of the Hamiltonian H\ or even to 
make them vanish. In analogy to (11.16) we choose 

5 = —^$0 , (11.49) 

2moc z 

i.e. S shall not explicitly depend on time. If the fields occuring in (11.47) depend 
explicitly on time (and thus also the Hamiltonian) the transformation S must gen- 
erally also be time dependent. Then it is usually not possible to construct S in a 
way that all odd parts of H f disappear in any order. Therefore, we restrict ourselves 
to a nonrelativistic expansion of the Hamiltonian H f into an exponential series of 
1/moc 2 . More precisely, we take into account only terms of order 
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/ kinetic energy \ 3 (kinetic energy) • (field energy) 

\ me 2 ) 


mfic 4 


We expand the exponential function into a power series 

= /> + i[M]_ + ^[s, [■§, 

To verify this we write down the second-order terms of S separately, 
« (-i) 2 i 2 S 2 . . . 

H ~2T S + ~2T H + iSH (~') S 

= j [5, [M]_]_ = t [§,SH-HS]_ 

\2 

1 / A A A A A A A A A a A a \ 

= - (S 2 H - SHS - SHS + HS 2 ) . 


(11.50) 


(11.51) 


We can check the validity of this general commutator expansion (11.52) by con- 
sidering the operator function 


r(A) = e^ e -«=f;^r") 

nS" ! \ dA ”/j-o 


and by verifying 
d F 


— (A) = e iAs i[S,tf]_ e 


— iAS 


dA 
as well as 
d n F 


^=e^i" S, [M]_ 


-iAS 


Thus the validity of (11.50) easily follows for A = 1. 

In the following we make use of the relations 

(30 = -0(3 and 0S = e(3 . (11.52) 

Expanding the Hamiltonian H f into powers of 1/raoc 2 , we restrict ourselves to 
terms up to order l/m^c 6 . Then we can write H' as 






+ 


1 

24 


5, 


5, 


5, [Sjm 0 c 2 _ 


+ ... 


(11.53) 


Up to terms of order “one”, H 1 is given by 
H' = $moc 2 + i + O + i [5 , /3] _ mQC 2 . 


(11.54) 
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Now we calculate the various commutators of S and H : 


* M j ((-^ d ) ^ +d+i ~>- ^ +6+t '>{^ 6 )) 


1 ^ 2^ 2m§c 2 


1 A A 1 A A 1 A 1 A A ^ 1 

PO 2 -- -POZ+- 0 - 

2m§c l 2 


2moc 2 


po 2 + 


. po 2 p . 

^~0 + ^ + ^~[0+] 


;2 r 


'S,[S,H}_ 


rriQC 2 IrriQC 2 

1 

2 


2m$c 2 

(11.55) 


epO 


1 $d)(-d + ^ + -jL;[d,e]. 


2m§c 2 


m§c 2 2moc 2 


- [ -O + ^ [0,e]l f -^-, 'po 


m$c 2 2/rioc 2 


2 \2moc 2 

+ 


po 2 + 


2/WqC 4 


O 3 + 


4m 2 c 4 


2moc 2 1 
0[0,e]_ 


1 po 2 + 1 


2/tIqC 4 


O 3 - 


4 m^c 4 


[0,e]_ a 


r 

3! 


2moc 2 

l A A l A A l r A 1 


f. [S, [5,#]. 

= K ^ d3 + + [d,[6 ' < ‘ L] - 

d,[d,[d,i].]_ 


(11.56) 


6/MqC 4 


d 3 - 


1 po 4 - 1 


6niQC 6 


48 m^c 6 


P 


We take into account only terms of the order of l/m^c 6 ; thus we get 


r 

4! 


S , 5 




24»JqC 6 


,30' 


(11.57) 


(11.58) 


Next we collect the terms of (11.54-11.58), which yields 
H' = Pm 0 c 2 + 1 + 0-0 + ^-^P [0,e}_ + — ^ pO 2 


2moc 2 


m§c l 


2moc 


;P 0 : 


- 4 ^ [d ’ e1 - ] - - 4^ 3 + 4^ 3 - 4^ 

O, [0,0, €]_}_' 


^24/MqC 6 ^" 48mQt 6 


P 


= P ( m 0 c 2 + 


1 


2moc 2 


O 2 - 


1 O 4 ] + t - 1 


8/WqC 6 


8/n^c 4 


[0,[0 ,£]_]_ 


: ptllQC' 2 + £ + O 


( 11 . 59 ) 
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At this step we might simply argue: Let us omit all odd terms of H i.e. the 
last term of (11.59) with the odd powers of O. But we may also formally reduce 
the odd part of H' by further Foldy-Wouthuysen transformations. At this point we 
perform a second transformation 


A . 1 A A f 1 

S' = — -(30' = -^-^(3 


2moc 2 


IntQC 2 \2ntoc 2 


1 




1 


3/WqC 4 


O 


(11.60) 


and obtain 

= + . (11-61) 


The term proportional to O' 3 contains large powers of 1 jm^c 2 and therefore it can 
be neglected; thus H" is given by 

H" = /3moc 2 + t' + — -(3 \0' , e r ] _ = (3moc 2 + e + O" . (11.62) 

O" is of the order of X/m^c 4 . To eliminate O" we apply a third transformation 


H 


S" = - 


- e s " H" 


with 


2moc 2 


(30' 


(11.63) 

(11.64) 


By neglecting the odd terms proportional to 1 /m^c 6 this yields 
H'" ~ (3m 0 c 2 + e' 

= 4 + ai 6 ' - 4^ 64 ) + 1 ’ 4^ [6, l6, eLl - 

— . (11.65) 

To illustrate this procedure we now calculate the various terms of H<p explicitly. 
Here we make use of the following, already known relations for two arbitrary 
vectors A and B : 


0 &-A)(a-B) = A-B + it:-(AxB ) , 
with which we obtain 


(11.66) 


2 


= 4 (* - + - ^) x 


2 m 0 


(p - -A\ - ■ 16 27 • (p x A + A x p) . 
\ c J 2moc 


Here p = -iftV and curl^A) = /curl A + grad / x A. This expression can be 
further simplified as 
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IniQC 2 


; 0 ' 


= ^( p - e c A J 


eh ~ 

r*(Vx A) 

2moc 
eh 


c / 2moc 
Next we look at the commutator [O, e]_ 


EB 


(11.67) 


8^7 [ °' e ' ] - = ( C& ■ i p ~ ‘c A ) eV ~ eVc& ■ (*> “ ‘c A )) 


8/jJqC 4 


i ehca • W = 


i eh 

8 /WqC 4 


cq. • E 


( 11 . 68 ) 


and, according to (11.65), at the commutator [O , a • E] _ 

a ieh „ „ 

O, - — ^—jcct • E 


8 mfic 4 


= < col 


[t> - e -*) 

„ i ehc 2 


i eh 
Smfic 4 


c&‘ E — 


i eh 

8 /WqC 4 


cat • i? 




8 WqC 4 


a • E 


( - ih(V • E) - ihE • V + i hE •V + iE-(pxE)-\E-(Exp)) 


a -p, 

ieh 

8 mfic 2 

ieh 

8 m 0 2 c 2 

ell 2 c 2 ich, 2 c 2 » 

div E + - — — 7 E • curl E + 


( - i h(V • E) + hE • (V x E) - iE • (E x p) - iE • (E x p)) 


eh 


Sm^c 4 


8 wqC 4 


4itIqC 2 


E- (Exp) 


(11.69) 


Adding the various contributions, we have 
H<p = H'" 


=/5 ( w2+ i(^-H 2 -^ 4 ) 


+ eV - 


1 /v ieh 2 c 2 * 

Peh(Z-B)- ir ^Z-(cur\E) 


2m§c 


Stride 4 


eh £.(Exp>-^div£ 


4wqC 2 


8 /WqC 4 


(11.70) 


Now it seems advisable to discuss the individual terms of (11.70). The terms in the 
first parenthesis result from the expansion of [(p — e/cA ) 2 + mo] and describe 
the relativistic mass increase. Subsequently follow terms describing the electrostatic 
energy and the magnetic dipole energy. The next two terms, which actually are 
Hermitian only if taken together, contain the spin-orbit interaction. This can be 
seen particularly clearly under the assumption of a spherically symmetric potential 
with curl E = 0, when 


« 1 dV - 1 <9V „ „ 

E-(Exp) = — — E-(rxp) = ——(E-L) 
r or r Or 


Thus we have 


K 


eh 1 dV 


spin-orbit — 


4/MqC 2 r dr 


(E-L) 


(11.71) 


(11.72) 
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This spin-orbit interaction is responsible for the splitting of states with the same 
orbital angular momentum /, but with different total angular momentum j. This 
interaction also is very important in nuclear physics for the classification of single- 
particle states of nucleons, but there the spin-orbit interaction is not of electromag- 
netic origin. The last term of (11.70) is the so-called Darwin term. It results from 
the Zitterbewegung of the electron over a region of a magnitude comparable to the 
Compton wavelength (2.61). For point-like nuclei we can write 

div22 = — A(j) = 4i tq c = 4neS(r ) . 

Considering now that for nonrelativistic wave functions only the s state do not 
vanish at the origin, and thus only #),s(0) ^ 0, one immediately realizes that for 
light atoms the Darwin term mainly results in an energy shift of the s levels. 

Finally we look qualitatively at the level scheme of the hydrogen atom (see 
Fig. 11.1). The fine-structure splitting due to spin-orbit coupling is quantitatively 
the largest contribution of the relativistic theory compared to the nonrelativistic 
Schrodinger description of the hydrogen atom. The previously discussed contribu- 
tions of the Hamiltonian yield good agreement between the theory and experimental 
data up to the year 1947, when as an additional effect the hyperfine splitting had 
to be taken into account. This splitting of the two normally degenerated spin lev- 
els. In 1947 Lamb and Retherford discovered a further shift of the 2 s\/ 2 level 
against the 2 /?i/ 2 level (indicated in the figure), which should be degenerated if 
exact solutions of the Dirac equation are considered (refer to Exercise 9.6 and 9.7). 
The physical origin of this quantum electrodynamical effect is the interaction of 
the electrons with the fluctuations of the quantized radiation field (self-energy and 
vacuum polarization), the so-called Lamb shift. 1 


A 


E 


3si/2 


3j>3/2 

3j’1/2 


3^5/2 

34j/2 


2 * 1/2 


2p3/2 - 


Lambshift 
2 / 21/2 


fine structure 


Fig. 11.1. Qualitative scheme 
of the energy levels of the 
hydrogren atom 


1 .<? 1/3 - hyper-fine splitting 


1 This is discussed in more detail in W. Greiner, J. Reinhardt: Quantum Electrodynamics , 
2nd ed. (Springer, Berlin, Heidelberg 1994). 
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Until now the solutions of the Dirac equation with negative energy have been a 
puzzle. Attempts similar to those we performed with the solutions of the Klein- 
Gordon equation, where the energy turned out to be positive (by the Lagrange 
formalism) for solutions with positive and negative time evolution factors, proved 
unsuccessful (cf. Exercise 2.3). Solutions with negative energy appear almost ev- 
erywhere when we are concerned with processes of high energy or with strongly 
localized wave packets (see Exercises 8.4, 8.5). At this point we have to confront 
this dilemma and find a proper solution! 

The existence of solutions with negative energy in the previous interpretation, 
as single-particle states of the electron, obviously leads to trouble and physical 
nonsense. Let us consider the electrons in an atom, the spectrum of which is once 
more given qualitatively in Fig. 12.1. 

The bound states directly below the positive energy continuum, with E < rrtoc 2 , 
are in general in very good agreement with experiments. It is beyond any doubt 
that these are the bound states of the (one-electron) atom. 

An electron in the lowest atomic state (Is) could lose more energy by con- 
tinuous radiative transitions. Thus an atom would be unstable and, because of the 
continuous emission of light, a radiation catastrophe would occur. However, such 
effects have never been observed! If this decay could happen, our world could not 
exist. Hence we have both a principle to uphold as well as a practical problem to 
solve to avoid electrons falling off into the states of negative energy. Neglecting 
the radiation field, the bound-state electrons would be stationary. By switching on 
the field (of course it is always “switched on”), and the use of radiation theory 
and of the wave functions found in Exercise 9.6, an infinite transition probability 
is obtained particularly if one takes into account the infinitely large number of 
final states in the lower continuum (see Exercise 12.1). But this is of course sheer 
nonsense! We must find a new physical idea to remove this dilemma, and, in its 
original form, this was provided by Dirac. 1 He assumed all states of negative 
energy to be occupied with electrons (see Fig. 12.2). 

The vacuum state is defined by the absence of real electrons (electrons in 
states of positive energy), and all the states of negative energy are filled with 
electrons. The vacuum state is the energetically deepest stable state , which can 
be realized under certain conditions (constraints such as, e.g., external fields). In 
the absence of the field the vacuum represents the lower (negative) continuum 
(it is also called the “Dirac sea”), whose states are completely occupied with 

1 P.A.M. Dirac: Proc. R. Soc. (London), A 126, 360 (1930); see also J.R. Oppenheimer: 
Phys. Rev. 35, 939 (1930). 



Fig. 12.1. Illustration of the 
radiation catastrophe of a ra- 
diating electron in an atom. 
It falls deeper and deeper 
by continued radiative transi- 
tions 




m 0 c ; 


0 + 


ft 


states of 
positive energy 



ft 


states of 
negative energy 


Fig. 12.2. In the hole theory 
the states of negative energy 
are occupied with electrons 
(X). According to the Pauli 
principle, each state can con- 
tain two electrons, namely 
one with spin up and one 
with spin down 
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electrons. This physical assumption of the negative energy continuum filled with 
electrons has very important consequences. We perceive at once that the radiation 
catastrophe mentioned above is now avoided because of the Pauli principle, which 
forbids transitions of real electrons into occupied lower states. On the other hand an 
electron of negative energy can absorb radiation. If the energy hw of the absorbed 
photon is greater than the energy gap (hw > 2moc 2 ), an electron of negative energy 
can be excited into a state of positive energy (see Fig. 12.3). 

In that case we get a real electron and a hole. The hole behaves like a particle 
with charge +\e\, because it can be annihilated by an electron (e“) with charge 
— \e\\ thus the hole is the antiparticle of the electron and is named positron (e + ). 
Obviously the creation of an electron and electron hole by photons is to be identified 
with electron-positron pair creation , with a threshold energy of 


Fig. 12.3. A photon of en- 
ergy hw > 2moc 2 creates an 
electron-electron-hole state. 
The hole is interpreted as a 
positron; hence the process is 
just e~ - e + pair creation 


hw = 2m<yc 2 


( 12 . 1 ) 


Alternatively, we call the process where an electron drops into a hole, thereby 
emitting an appropriate photon, pair annihilation (or matter-antimatter annihilation, 
or e + e" annihilation). The energy balance of the pair creation is 


— ^electron with pos. energy ^electron with neg. energy 

= ^+c \/p 2 + m 0 c 2 c 2 ^ ^ — c \J p a + m 0 c 2 c 2 ^ 

= ^electron “1“ ^positron ? 


( 12 . 2 ) 


and we can associate the electron with the positive energy 


^electron = +cfp* + tn%C 2 . 


(12.3) 


So far this is nothing new. What is new, however, is that according to (12.2) we 
have to give the positron (electron of negative energy) a positive energy, namely 


^positron = +C^Jp' 2 + m^C 2 . (12.4) 

In the special case of vanishing positron momentum p\ it follows that the positron 
has the rest mass 


(^positron) at rest — m$C . (12.5) 

Therefore positrons (electron holes) have the same rest mass as electrons but op- 
posite charge (as we have shown above). Similarly we obtain the following mo- 
mentum balance: the photon has momentum hk , which is distributed to electrons 
and positrons. We conclude that initial total momentum = final total momentum, 
i.e. 


Hk + (p )electron with 
Hk — (p)electron with 


neg. energy 
pos. energy 


— (P)electron with 
( P )electron with 


pos. energy 
neg. energy 


or 


and write 


( 12 . 6 ) 

(12.7) 


hk — (p)electron H" (p )positron 


(12.8) 
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Hence, the positron has the opposite momentum to the electron, and negative 
energy. Indeed, a missing electron (negative energy) with momentum p' should 
behave like a positively charged particle with the same mass and opposite momen- 
tum. In this way we can easily explain the previous paradoxical fact that the mean 
particle velocity of electrons with negative energy equals 

c (P)electron with neg. energy (12 9) 

E p 

[see (2.62) ff.], since we are now dealing with the positron velocity, i.e. 


-C (p)electron with neg. energy "be (P)positron 


( 12 . 10 ) 


The most important result of the hole theory is that it is the first theory which 
introduces a model for the vacuum, i.e. for particle-free space, in a naive sense. 
The vacuum is here represented by the Dirac sea, which consists of the states 
of negative energy occupied by electrons. This vacuum should have zero energy 
(mass) and no charge. However, it is clear that the model in this simple form does 
not have these properties. The states occupied with electrons of negative energy 
together have infinitely large negative energy and infinitely large negative charge. 
Both have to be renormalized to zero, i.e. the zero point of energy and charge 
is chosen in such a way that the Dirac sea has no mass and no charge. This 
renormalization procedure is not very satisfactory (aesthetically) but it is feasible 
(though soon we will discuss a better model, see Sect. 12.4). At this stage we find 
the qualitatively important fact that the vacuum can be modified, for instance, by 
the influence of external fields: these can deform the wave functions of the states of 
negative energy occupied with electrons. Hence they produce a measurable vacuum 
polarization with respect to the state without external fields. 

Let us stress the point that the hole theory is a many-body theory, describing 
particles with positive and negative charge. Indeed, infinitely many electrons are 
needed to constitute the Dirac sea. The simple probability interpretation of the wave 
functions acclaimed in a single-particle theory cannot be true any longer, because 
the creation and annihilation of electron-positron pairs must be taken into account 
in the wave functions. 

Resume. In early relativistic quantum mechanics, the Klein-Gordon theory was 
dismissed because it did not seem to allow a proper probability interpretation, and 
also, the appearance of states of negative energy was problematic. Hence Dirac 
formulated the Dirac equation with the intention of establishing a true relativistic 
single-particle theory. As we now know, the difficulties with the negative energy 
states of the Dirac equation almost of necessity demand a many-body theory (hole 
theory), and therefore the question arises whether it should also be dropped. On the 
other hand, though, we have been very successfully applying the Dirac equation to 
many problems (e.g. prediction of spin, of spin-orbit coupling, of the g factor, of 
atomic line structure). Additionally, by extending the single-particle Dirac theory 
to the hole theory, it has new, impressive success: 

The prediction of the positron as the antiparticle of the electron was experimentally 
confirmed in all points, including the correct threshold (12.1). Equally, the vacuum 
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polarization mentioned above, as well as many other effects, were confirmed by 
experiments. The previously discussed Zitterbewegung does not vanish in the hole 
theory, though because of the filled Dirac sea one might naively suppose that the 
states of negative energy are not available due to the Pauli principle; however, 
there still exists an exchange interaction and a related scattering. Accordingly, 
virtual electron-positron pairs are continuously created in the vacuum. The original 
electron (in a bound or free state) can fill up the virtual hole in the Dirac sea, and the 
other electron takes its place, this exchange interaction causing the Zitterbewegung. 
In its physical content the Dirac theory (extended by the hole theory) is the basis of 
quantum electrodynamics, which currently is the best-established theory of physics. 

Dirac originally formulated his equation with a particular set of motives. The 
relativistic theory for spin — j particles developed in this way had to be reinterpreted 
(hole theory) to get rid of its contradictions and became even more successful. 
Although the original motivation was not plausible from our present point of view, 
it nevertheless apparently showed the right direction. Often in the history of physics, 
a great success is achieved after a series of more or less erroneous investigations, 
which finally lead to new concepts and insights. In the following, we retain the 
Dirac equation and its reinterpretation by hole theory and extend it, also improving 
on the accuracy; however we drop the one-particle probability interpretation (except 
for illustrative purposes). 


EXAMPLE 


12.1 Radiative Transition Probability from the Hydrogen Ground State 
to the States of Negative Energy 


Problem. Estimate the transition probability for a radiative transition from the 
hydrogen ground state into an electron state of the empty negative continuum with 
— niQC 2 > E > —IrriQC 2 . 


Solution. This problem is to show the idea that the electron states with nega- 
tive energy have to be considered as occupied. The Dirac theory yields for the 
electromagnetic interaction (h = c = 1 ): 

Hi at = -e\p(x ) • A(x) 

= —e$ (x)atxj)(x) • A(x) . ( 1 ) 

We quantize the electromagnetic field in a box of volume L 3 and insert the expan- 
sion 




a k' 






( 2 ) 


Here a^ a and aka are the creation and annihilation operators for photons of mo- 
mentum Hk and polarization cr. The initial and the final states are of the following 
form: 


(1 photon with k,a\ ■ (tp{\ = (f | 


(3) 
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and 


|i) = |'0i) |photon vacuum) . 

With this it follows that 

= ~\J JT~ e J d 3 xii}l(x)aipi(x)-£ k ae' k ' x ■ 

We use the dipole approximation exp(ifc • x) ~ 1 and the relation 


ca = ^[H,x]_ , 

which follows from the Dirac equation. Hence 

(f |A„«|i) = e(E { - (/ d ' x $ (*)*M*)) • 


Then Fermi’s golden rule yields 
/ transitions^ 2n 

\ time J h 


EE E e 2 < £ i - £ '> 2 


p r— 3,4 J k,a 


2n 

L 3 U)lr 


X d 3 x^f t (x)xr/>i(x) • e k ,o 


S(uJk — E[+ Ef) 


(summed over all p with moc 2 < E(p) < 2moc 2 ). 


(4) 

( 5 ) 

( 6 ) 
(7) 


( 8 ) 


Example 12.1. 



The first bracket of (8) contains the summation over all permitted final states 
of the electron: r = 3,4 characterizes the two spin directions of the final negative 
energy states. Now we can chose the polarization vectors e k , a * n such a way that 
at least one of the polarization vectors, say e k ,i, is orthogonal to k (see above 
figure). From now on we set h = c = 1 . With 




(9) 


one obtains 

/ transitions^ _ 1 
\ time J 2n 


[E]/// 

L p,r J J J J 


J d 3 x xl>f\x)x'il) j(x) 


d u>k dr? dp ui\ sin He 2 (E, — Ef) 2 


u k 


|ejfc, 2 | 2 sin 2 (c^jt -E\ + £ f ) . (10) 
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The diagram is useful for a better understanding of (10). With 


f 


dt? sin 3 $ = 


x cos 3 $ — cos 7? 


4 

3 


(10) turns out to be 
( transitions \ 4e 2 


J - — XI ( Ei ~ E( ^ [ d3x 4>f i (x)xipi(x) 
' PJ ^ 


V time / “ ^ ^> \ I u A n w^iv-vi , (11) 

and we also normalize the final electron states in a box with volume £' 3 (£ = £ f ) : 

W = \f77^ u > r (P’ s ) Q ~ iP ' 


These wave functions are normalized to 1 : 

f d?x </•;'«</>,''(*) = -, % f d 3 ^-4„, = 6 , 
J lE £ £ Ju'i mo 


( 12 ) 


(13) 


similarly to the wave function for the hydrogen ground state (we choose the case 
with spin up): 


/ 


V’i 


(2moa) 3 / 2 [ 

~V4^~V2r( 1 + 2 7 ) 


+ 7 


(2moar)' y ~ l e~ m ° ar 


1 

0 

i(l - 7) 


\ 


COST? 


a 

i(l - 7) • q iv , 
sinve^ j 


a 


7 = \/ 1 — a 2 , e 2 = 47ra . 

a is the fine-structure constant, a « 1/137. With [cf. (6.30)] 

^ )= /g±j o(Pz_ P- ! Q 

V 2mo (2mo’E+mo’ ’ 


(14) 


w 4t (p,j) = 


_ E + mo ( p+ -p z 


2 mo \E + m 0 ’ £ + m 0 

one obtains, with the approximations 

1-7 l-(l-a 2 /2) a 

— — « i L = - <C 1 

a a 2 

and the abbreviation 


, 0,1 


( 15 ) 


(2 motif I 2 / 1 + 7 , 

a(r) = - —= — w — (2 mnar) 7 ~ 1 e ~ m ° ar 

V 2r(i +1 )^ m ° ar) e 


( 16 ) 
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E / d * x ^ 

’ •{!/// 


(x)a:^i(x) 


1 


m 0 


2mo(E + mo) L S E 


r dr ch9 dip sin i9xa(r) e 1 


i|p| \x\ cos 


(p? + \p + I 2 ) 


2E(E + m 0 )Z/ 3 

+ 


sin i9r cos 'da(r) e lpr cos ,3 


,i/?r cos $ 


JJJ r 2 drd'ddp sintfr sin$siny?a(r)e' 

+ JJJ r 2 dr dd dip sin i?r sin i? cos <^a(r) e lpr cos 15 

Performing the p integration yields 

f |2 

^ / d 3 * ^ + (jr)£ci/»i(:r) 

r ^ 

/? 2 4tt 2 


POO /»7T 

dr di? r 2 a(r) sin D cos $ e ipr cos 13 

JO JO 


2£(£ + m 0 )L' 3 

and with 7 « 1 it follows that 

3 , x (2 m 0 a) 2/2 3 _ 

r a(r)K — — _ r e ° Qr . 

x/4tt^ 

Insertion of (11) results in 

/ transitions^ _ ^nm^a 2 ^ P 2 (^i + £) 3 
V time ) 3 2^ Z/ 3 £(£ + m 0 ) 


p 

POO PIT 

/ dr / 

7o 


dtf r 3 e~ m ° ar sin 1 ? cos 1 ? e ipr cos 1 


with e 2 = 4ira, and now the t? integration is performed: 


(17) 


(18) 


[ di? cos -d sin 1 ? e lpr cos = — cos e ipr cos — f dd sin {) c ,pr cos{> 

Jo * ' W 0 W Jo 

U y' 


— (e ipr + e -ipr ) - 
ipr v J p 2 r 2 


Jipr cos $ 


= — (e^ + e-^) + -T, (e^ - e"^) . (19) 
ipr v 7 p 2 r 2 v ’ 


With 


f 


r" e -0 " dr = 


w! 

t/i + 1 


Example 12.1. 


it follows that 
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poo p7T 

/ ^ 

Jo Jo 


dd r 3 e~ m ° ar sin cos i9 e ipr cos ’ 


1 




ip \(m 0 a-ip) 3 ' (m^a + ip) 3 
1 / 1 1 


p z \(moa - ip) 2 ( moot + ip ) 2 


i 4mla i - l2moap 2 1 i4moa 


+ 


P {m^a 2 + p 2 ) P (m^a 2 + p 2 )* 

i I6m 0 ap 2 . l6moap 


= l- 


P {m^a 2 + p 2 Y ( m^a 2 + p 2 ) 3 
In (18) we replace 1/Z/ 3 by J d 3 pi/ (2 tt) 3 , which yields 


/ transitions^ 

\ time J 
4x8x7 r 2 


16 2 m^a 6 j dV^“ 


(£, - £) 3 


(2tt) (m^c^ + p^ E{E + mo) 


\/3mo 


( £ i + ^/p 2 + m o ) 


16 3 f 

= —mla 6 / dp- , 

0 (/Wo<* 2 +p 2 ) yp 2 + n»o (\/p 2 + m o + m o) 

V5 


16 3 6 f 

= -ymoa J 


dx 


(Ei/mo + Vx 2 + 1 ) 


3 


(a 2 + a : 2 ) 6 V* 2 + 1 (V* 2 + 1 + 1) 


( 20 ) 


( 21 ) 


The hydrogen ground-state energy is approximately E\ = mo—a 2 mo/2. The integral 
(21) is well defined and yields a finite value, for which we want to estimate the 
lower bound: 


x 6 {Ei/mo + Vx 2 + 1 ) 3 ^ x 6 (1 + l) 3 

{a 2 +x 2 ) 6 y* 2 + 1 (Vx 2 4- 1 + l) > J (« 2 + 3) 6 

o o 


V : 

/ 


dx- 


2-3 


2 3 3 3 \/3 _ 2 2 \/3 / transitions^ 

\ time J 


3 6 x 2 x 3 7 3 4 x 7 

16 0.51 MeV 

137 6 6.5 x 10~ 22 MeV s ~ 


> 16moa 6 


10 9 - 


( 22 > 


If the lower (negative) continuum were empty, then the hydrogen atom would decay 
in a very short time (r < 10“ 9 s). In order to ensure the stability of the hydrogen 
atom (and of all other atoms) one has to regard the negative energy continuum as 
fully occupied with electrons, so that the Pauli principle blocks those states from 
being available for radiative transitions. 
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The hole theory leads us also to a new, fundamental symmetry : For the electrons 
there exist antiparticles, the positrons. In general it is true that for every particle 
in nature there exists an antiparticle. Now we have to formulate this symmetry 
mathematically in a rigorous way. Thereby we shall see how the wave functions 
for the positrons follow from the wave functions of the electrons with negative 
energy, and vice versa. 

According to the hole theory a positron is a hole in the filled “sea of electrons 
with negative energy”. According to (12.4-12.7) this positron has the same mass 
as the electron, positive energy (£p OS itron = -^electron of neg . energy), and opposite 
momentum and charge as the electron with negative energy. There exists a one- 
to-one relation between solutions of the Dirac equation for electrons with negative 
energy, 

(ifiy- -4 - m 0 c) = 0 , ( 12 . 11 ) 

and the positron eigenfunctions. 

The positron wave function f> c has to fulfill the equation 

(ifty+ e -4 - m 0 c) ri> c = 0 , (12.12) 

because positrons should have all the properties of positively charged electrons. 
Note that the positron wave function *0 C should be a solution of (12.12) with 
positive energy ! 


Remark. If is of no consequences which one of (12.1 1) or (12.12) is labelled as the 
first (particle). Historically (12.11) was the starting point for electrons, but (12.12) 
could be considered as the particle equation, as well. The sign of the charge e of the 
particles described in the initial equation does not matter either: If we had chosen 
( 12 . 12 ) to start with, we would have obtained a spectrum of solutions for the free 
positrons like that known for electrons from (12.11) (see Fig. 12.4). The negative 
energy states would then have been filled with positrons (•) in the framework of 
the hole theory, while electrons would then have been positron holes with wave 
functions given by the negative energy solutions of (12.12). It is now our aim to 
find an operator which connects the solutions - 0 C of ( 12 . 12 ) with -0 of ( 12 . 11 ). 
This operator must change the relative sign of iftyand {e/c)Jfi. This is simply done 
by complex conjugation: 


i h 


dx* 1 


= — i h 


d_ 
dx » 


A* — A 


(12.13) 



The electromagnetic four-potential A ^ is always real; thus we take the complex 
conjugate of ( 12 . 11 ) and after the conversion of all signs we arrive at 



+ % 
c 


7 ^* + m 0 c 


=0 


(12.14) 


Now we wish to find a non-singular matrix 0 = C 7 0 in such a way that 


Fig. 12.4. The vacuum of the 
hole theory based on posi- 
trons as initial particles. Here 
a positron hole is an elec- 
tron. As marked by the full 
circles (•) the states of neg- 
ative energy are occupied by 
positrons 
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UY*U~ X = - 7 m 


(12.15) 


holds. Before doing so, however, let us first proceed as if U is already known. 
After multiplication with U (12.14) transforms into 


j k ® i £. a \ — 1 

dx» + c 


+ -_ A n ) +moC 


Uip* =0 


or, with equation (12.15), 


m -L + ‘c A » 1 7 '‘ - 


/n 0 c 


U$* = 0 or 


\ny+-4- 

L C 


moc 


Uxp*= 0 . (12.16) 

For the positron wave function a comparison of (12.16) with (12.12) yields 

i/> c = 0i>* = C 7 V = c? T . 

The superscript “T” indicates “transposition” and means 

i[) T = (V’ t 7°) T = 7 0T V’t T = 7° (^*T) T = 7V* , (12.17) 

because of 7 0T = 7 0 and ^ * T . To determine U explicitly we rewrite (12.15) 


as 


C 7 V*(C7°) 1 =^7V*7 0 ^ _1 =~7 m • (12.18) 

Now, according to our explicit representation (3.13) for the 7 matrices, i.e. 

1 O' 


7 


0 a‘ 
0 


7° = 


0 -n 


the identity 

,y0,y***,y0 _ 

can easily be derived. Thus for (12.18) we get 


(12.19) 


( 12 . 20 ) 


(C- 1 ) V(C) T = -7^ T , 


( 12 . 21 ) 


and 


JT 


2T 


7“ = ~7‘ > T‘ =T , 7 3T = -7 3 , 7 U1 =7° , (12.22) 

(C) T must commute with 7 1 and 7 3 and anticommute with 7 2 and 7 0 . Therefore 
C = i7 2 7° (12.23) 

is a useful choice for the operator C, and it holds that 


OT 


C = i 7 2 7° = -C 


i-l 


-ct = -C 1 


(12.24) 


This clearly illustrates the non-singularity of C (the inverse matrix is explicitly 
constructed). Of course (12.23) is valid in the special representation (12.19), but 
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C can easily be given in every other representation via a unitary transformation. 
The phase of the operator C is rather arbitrarily fixed by the factor i in (12.24). 
Indeed, the choice of the phase does not influence at all our current investigations. 
Thus the charge-conjugate state of ip(x) is given by 

ipc = tip = C^kxl) = C 7 V - CtA T = i 7 V*W , (12.25) 

where K is the operator of complex conjugation. According to (12.12) the wave 
equation for ip c (x) differs from the wave equation for ip(r) (12.11) just by the sign 
of the charge. Thus it follows: If ip(x) describes the motion of a Dirac particle 
with mass mo and charge e in a potential A fl (x ), then ip c (x) represents the motion 
of a Dirac particle with the same mass mo and opposite charge ( c) in the same 
potential A^(x). The spinors ip and ip c are charge conjugate to each other. With 
the relations (12.20), (12.25) and 7 2T = 7 2 , it holds that 

(V’c)c = (i 7 V"(-«)) c = 17 2 {i 7 v*(*))* = 7 2 7 2 *^{x) = 7 2 7 o 7 2T yv(*) 

= 7 2 7°7 2 7°V’(a:) = -tVtWW = ipix) , (12.26) 

that is 


(V’c)c = Ip 


(12.27) 


Thus the correspondence between ip and ip c is reciprocal. Besides, we draw the 
following interesting conclusion about the expectation values of operators. Let 

(Q) = {fp\Q\rp) (12.28) 


be the expectation value of an operator Q in the state ip. Then the expectation 
value of the same operator Q in the charge-conjugate state ip c is given by 


(Q)c = (V'clQIV’c) = j iplQip c d 3 x = J (i~f 2 ip*y Qi^ip* d 3 

= J ip*^ 2 ^ Qj 2 ip* dh = y d 3 x 


= - J V , * t 7 2 7 ° 7 0 < 27 2 V’* d 3 x — - J ip*^ 2 Q^ 2 ip* d 3 * 

= - {i 2 qi 2 Y^p^x\ = -(^i7 2 *eV*i^)* 


= -(i , \i 2 Q* , y 2 \^)* 


(12.29) 


In this manner one easily proves the following relations (cf. Exercise 12.2): 


(a) 

<0>c = 

-(/?) , 


(b) 

(x) c = 

(x) , 


(c) 

(®i)c = 

(7°7'')c = 

= <“») . 

(d) 

(P) c = 

-<P> , 


(e) 

II 

1p ] 1p , 


(f) 

1pt&1p c 

= ip^aip 

j 

(g) 

<^)c = 

-<£> , 


(h) 

(L)c = 

~(L) , 

L = r x p , 

(i) 

<->>c = 

~(J) , 

J = r xp+\E 


(12.30) 
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Because the Hamiltonians of both Dirac equations (12.14) and (12.12) are given 
by 

H(e) = ca l (pi + + eA o + fimoc 2 

= col • (p - + eA 0 + Pntoc 2 (12.31a) 

or 

H(—e) = ca' (pi — -A / j — eAo 4- j3moc 2 

= ca • (p + -A^ — eAo + f3moc 2 , (12.31b) 

one also deduces with the help of (12.29) the relation (cf. Example 12.3): 

{H(-e)) c = ~{H(e)) . (12.32) 

The results of (12.30) and (12.31) are quite interesting. Accordingly the charge- 
conjugate solutions 'i/; c have the same probability density and probability current 
density in all space-time points (12.30e,f). Therefore the electric charge density 
and the electric current density for ip and ip c are contragredient. Equations (12.30) 
and (12.32) express the important result that a charge-conjugated state ip c has the 
opposite momentum and energy to the state ip. The relations of the hole theory 
expressed in (12.4) and (12.7) read in their most precise form: charge conjugation 
changes the sign of the momentum and energy. 

exercise 

12.2 Expectation Values of Some Operators in Charge-Conjugate States 
Problem. Prove the relations (12.30). 

Solution. According to (12.9) the expectation value of an operator Q with regard 
to a charge-conjugate state ip c is given by 

(Q) c = (V'clQlV'c) = - J f)\x)y 2 Q*y 2 ip{x)<£x 

= -(V’It 2 !2*7 2 IV>)* • (i) 

From this we conclude in particular: 

(a) Q = $ = 7°: 

0}c = -(V’|7 2 7°7 2 |V’}* = (V'|7 2 7 2 7 ° \^Y = -(V'|7°l^)* = 
and therefore 

0)c = ~0) ■ 

(b) Q = &i = 7 ° 7 ' : first case ( i = 2): 

(&2)c = -{V , |7 2 7°(-7 2 )7 2 |V’)* = tyYlWW) = <«2> 


( 2 ) 
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Second case (i = 1, 3): 

(<*f)c = — <V , |t 2 T°(+t' )T 2 |^)* = -<V’|7°7 2 7 2 7 , |V')* 
= -(V’|7 , 7°IV’>* = +(ip\'l 0 l , \'4>) 

= (&') , 

and therefore 

<«i)c = (&i) i = 1,2,3 . 

(c) 0 = £: 

r r 

V’^(x)7 2 *7 2 V’(a;) d 3 * | 

= (V'l&IV')* = (V> l&lt/’) 

and therefore 

<*>c = <a> • 

(d) G = p = (V i)V: 


(3) 


(&>c = - 




(4) 



— - V ) j 'ip(x)d*x 


(P) C = - 

= -(V’IpIV’)* = -(V’IpIV’} , 

and therefore 

(P)c = -(p) ■ 

(e) Because is a real number, the following holds: 

'ipl'ip c = (^*^7 2 ^(— i)) (i7 lf ip*) = — '0*^7 2 7 2 ^* = 

= '0 T '0* = 

From this we conclude that 
■ 

(f) If we apply the identity 

ipl&iip c — '0*^7 2 ^o:/7 2 '0* = — '0 T 7 2 7°7^7 2 '0* 

to the cases i = 2 

2022 20 02 * aT 

7 7 7 7 ~ — 7 7 ~ 7 7 — o^i — ~ol 2 

and i = 1, 3 

7 2 7°7*7 2 — 7 2 7 2 7°7* = — a;/ = — ctj , 

we get the relation 


j V> f (®)y 


V^(a:)d 3 j: 


(5) 


( 6 ) 


Ipl&jlpc = 


Exercise 12.2. 
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because the expression in parentheses is an ordinary number; hence (compare with 
b): 


'tpl&'tpc = i 

(g) Q = £ = 1 & ° 


o a j 4 

(£i)c = ~( , 4>h 2 £*'y 2 \ip)* , 


with 

7 2 £* y2= ( o & 2 \fa; o\ / o a 2 
7 ' 7 \ a 2 o o a*)\-& 2 o 

o a 2 a* Wo & 2 
-& 2 aj o ) \ -& 2 o 

We get for i — 2 

-0 2 &l& 2 = 0 2 U 2 0 2 = &2 = = <T 2 * T 

and i = 1, 3 


d 2 d* d 2 0 
0 a 2 a* a 2 


-a 2 a t a 2 = -o 2 Oi<j 2 = o 2 a 2 Oj = <r, = a t 
and therefore 


*T 


(^•)c — \ ^ 


7* J 0 

0 <7* T 


= -W£i\4’) , 


i.e. 


(£i) e = -(£,) ■ 

(h) Q = L = x x p = x x h/ iV: 


(L) c = 


J r/>^(x)7 2 


ft. 


r/> T (x) 7 ( * x yV ) 7 r/;(x)d J x 


= — J x -rv) ip(x)d i x — —(il>\L 


( 7 ) 


( 8 ) 




i.e. 


(t>, = -{£) 

A A 

(i) Q — J- 

(J) Z = {J) C + \{E) c = 


( 9 ) 


-<!,> - i(l7) , 


i.e. 


(•>>0 = -(J) 


( 10 ) 
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EXERCISE 

12.3 Proof of {H(—e)) c = —(H(e)) 


Problem. Prove the relation (H(—e)) c = —(H(e)) and interpret the result. 


Solution. From (12.31a) and (12.31b) we have 

H(e) = ca • (p - ^-A'j +eA 0 + (3m 0 c 2 , (1) 

H(—e) = ca-(p + ^A?J—eAo + j3moc 2 . (2) 

Using the relations derived in Exercise 12.2 we find 


(a * P) c = 


J xj)\x) (— 7 2 a*7 2 ) i/>(cc) d 3 x 


In Exercise (12.2b) we derived the relation — 7 2 ct 2 7 2 = 7*7°^ = cU. Therefore 

(a • p ) c = -(V'|a t • p\ip) = ~(ip\p • a t |V’) = • pIV’) , 


(a-p)c = -(& -p) . 

(3) 

In addition, 


o 

II 

1 

(4) 

(ea • A) c = (ea. • A) , 

(5) 

(eA 0 ) c = (M 0 ) . 

(6) 


From (1) to (6) follows 

(H(-e)) c = -c(a • p) + e(a • A) - (eA 0 ) - (/3/noC 2 ) = ~(H(e)) , 


i.e. 


(H(-e)) c = -{H(e)) . 

This result means that the negative energy solutions of the Dirac equation cor- 
respond to the charge-conjugate solutions of positive energy (and vice versa). In 
Exercise 12.2 we derived that these solutions have opposite spin and momentum, 
properties which allow us to interpret the solutions of negative energy (charge e, 
spin s, momentum p) as wave functions of particles with positive energy (charge 
—e, spin —s and momentum — p). Compare this with the discussion following 
somewhat later on charge conjugation of bound states! 


306 


12. The Hole Theory 


EXERCISE 


12.4 Effect of Charge Conjugation on an Electron with Negative Energy 

Problem. Examine in detail the influence of the transformation 

Tp c = C ij? = i'y 2 -i))* 

on the eigenfunctions of an electron at rest with negative energy. 

Solution. A (free) electron at rest is described by the wave function (6.1). For an 
electron with negative energy and spin down (|) we have 


V > 4 = 


1 


\/ 2 nh 


/°\ 

0 

0 

Vi/ 


j+ifmoc 2 / h)t 


( 1 ) 


The corresponding solution with positive energy and spin up reads 

1 



y/ 2 nh 



0 

0 

0 


f°\ 


0 

0 

i 

0 

0 


0 

i 

0 

° 

0 

\ 

-i 

0 

0 

0/ 

\ 1 / 


e“ 

-i(moc 

2 /h)t 




+i(moc /h)t 


\Z2nh 


( 2 ) 


\0, 


Similarly, for an electron with negative energy and spin up (|) 


ii? = 


V2nh 


i 

Vo/ 


a +i (m 0 c 2 /h)t 


(3) 


we get the positron wavefunction for positive energy and spin down (j) 

\ 

1 


(^ 3 ) c = 17 V* = i 


/ 0 0 0 -i 
0 0 i 0 
0 i 0 0 

V-i 0 0 0 



a +i (m 0 c 2 /h)t 


= (-D 


/ox 

0 

Vo/ 


1 


y/ 2 n h 


-i (moc 2 /h)t 


(4) 


Here an inessential phase factor (-1) appears. This example demonstrates explicitly 
the influence of charge conjugation: The absence of an electron at rest with negative 
energy and spin | (j) is equivalent to the presence of a positron at rest with 
positive energy and spin j (|). If there are no fields present, there is no difference 
between electron and positron: From (2) and (4) one sees that the transformation 
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of charge conjugates leads back to other electron solutions in the field-free case. 
This example makes the rather strange re-definition of the spinors u, 3 and u 4 [see 
( 6 . 56 )] conceivable: An electron of negative energy with spin | (f) corresponds 
to a positron with spin | (|). The t>(x,.s) spinors of ( 6 . 56 ) describe positrons with 
spin projection s. 


EXERCISE 

12.5 Representation of Operators for Charge Conjugation and Time Reversal 

Let 7^ and 7^ be two representations of the 7 matrices connected by a unitary 


transformation 

^ = U'f'u~ x . (1) 

Problem, (a) Show that 

C' = U~ X C (U T )~ l , (2) 

where C and C' are the respective matrices for the transformation of charge con- 
jugation 

(b) Are the relations 

C = -C -1 = -C f = -C T = i7 2 7° ( 3 ) 

also valid for C f ? 

(c) Analogously to (a), show that the general expression of the time inversion 
operator reads 

Tq = u~ l f 0 ( U T ) 1 , where t 0 = ryV . ( 4 ) 

Solution, (a) If solves the Dirac equation 

(ih^d^ - ^7^ - moc'j ip = 0 , (5a) 

the charge conjugated equation is 

+ ^7^ - m 0 c) ip c = 0 . (5b) 

Inserting the transformation (1) into ( 5 a), one deduces the transformation for the 
wave functions as (7^ = U~ 1 J»U) 

iP' = U- x iP , (6) 

and analogously 

i>' = u- l ip c 


Exercise 12.4. 


( 7 ) 
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Exercise 12.5. 


The charge conjugated wave function follows from 

= cv> T . 

(8) 

With (7), (8) and the relation 7 °* = 7 ° = 7 0T 

ijl = (■0^ 7 °) t = 7 °v = 

(9) 

we obtain 

xl>' c = U- x C^ = U- l C^°r . 

(10) 

Because of 

<//* = (U~ x )*ip* and = 

it is valid that 

= £/ _1 C 7 °f/V* . 

(11) 

Further we obtain 

7 0 '* = (c/— *)* 7 °*t/* = (t)- 1 )*/#* 

= f/ T 7 °(t/ _1 ) T = (U- X J°U) T = 7 °' T . 

(12) 

Now we use the unitarity of the transformation 0 

u- x = u^ = (u T Y , 

(13) 

so that (11) leads to 

$ = t/ _1 C 7 °£>V* 

= U~ X C ( U*U*~ X ) 7 °t/ V* 

= U~ X CU* (U*- x y°U*) ip'* 

= t/ _1 cf/* 7 °'V* 

= u- x cu*(ip'^°') T = u~ x cu*ip >T . 

(14) 

Comparing this equation with the definition of C' 
V>' = c'ip ,T 

(15) 

we immediately get 

(16) 

(b) We analyse the various relations (3) one after another 

c ,T = t/ _1 c (i/ T ) _1 T = f/ _1 c T 
= -t/ _1 c (iy T ) -1 = -c' . 

(17) 


This relation is thus conserved. For arbitrary unitary matrices U , with 0 — £/*, 
we have on the other hand: 
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c't = [^- , e(^ T ) _1 ] t = [(^ T ) _, ] t c t (^- i ) t 

= U T C^U = -U T CU ± c' , 

(18) 

c'* = 0 T C*f) = U T CU + C' (C is real) , 

09) 

= u J c-'u = -U T CU = C'^-C' , 

(20) 

i7 0< 7o = iU- l 7 2 UU~ 1 7 °U = if/'^Vt) 

= t/ _1 c0 ^ t/ _, C (t/ T ) * = C' . 

(21) 


Remark. For real matrices U(U = U*) all relations are still conserved. 

(c) The operator of time inversion is defined by the equation 

M') = f 0 r(t) , ( 22 ) 

where 'iprit') is the time inverse transformed spinor with t' = —t. We will proceed 
analogously to (a) and obtain 

= = = = , (23) 

i.e. 

t'=U-'T 0 (U T )-' , (24) 


12.2 Charge Conjugation of Eigenstates with Arbitrary Spin 
and Momentum 


Let \jj{x) be an arbitrary plane wave with momentum p. We know how by use of the 
projection operators [see (7.8), (7.21)] to constr uct from ip (x) plane waves of the 
four-momentum p M = {ep°,p} with p° — +c^/p 2 + moc 2 and spin s ^ = {j°, s}, 
namely 


'tpeps (.X ) — 


e// + m 0 c 
2moC 


1 +75^ 
2 


4>(x) 


(12.33) 


These plane waves have positive or negative energy for e = ± 1 , respectively, and 
spin in the s direction. Because of 

[C\7 5 ]_ = [i7 2 7°.i7V7 2 7 3 ]_ =° (12.34) 


75 = (i7°7'7 2 7 3 ) T 

_ • 3T_2T_1T_0T 
= 17 7 7 7 

= i (~7 3 ) (l 2 ) (-7 1 ) 7° = i7°7’7 2 7 3 = 75 

i - ,0 -.3* 0 _ .0 _ .2* -.0 0 1 * -.0 0 0* 0 

1 /V 'V / y 'y 'y ^y 


Exercise 12.5. 


and 


310 


12. The Hole Theory 


[using (12.20)] 


= i7°7 3 *7 2 *7 U 7°*7 0 = -i 7 3 * 7 2 V*7°* 
= -i 7 0 V*7 2 V* = 7s , 


after charge conjugation (12.33) yields the state 
(' l Peps)c — Clp tps = C 7 °'0 epJ 



From (12.18) we get 

C 7 V*(C7 0 ) _, = -7" , 

and, further, 


(12.35) 


(12.36) 


p n* = +p n 


(12.37) 


because only the momentum (a real number), not the operator of the momentum, 
appears in the projector [see (7.8)]. Therefore from (12.36), with the aid of (3.1) 
and with 7 ° 7 5 7 ° = — 7 s, we get 


(’tpeps )c 


-epf + mpc 
ImQC 

-tfj + tn 0 c 
2moc 


1 + 75 / 
2 

1 + 75^ 
2 


cyv* 


4>c 


(12.38) 


The charge-conjugated solution of f cps (12.33) thus has the same polarization s t , 
but opposite energy and momentum (—tp 11 ) as the original solution. The latter fact 
is expressed by — ep — —c{p°,p}. The inversion of momentum and energy was 
expected, but (naively) so was the inversion of the polarization s p . To understand 
why the polarization is not changed under charge conjugation, let us remember 
(6.56), where the spin projection (polarization) of electrons with negative energy is 
indeed defined with inverted sign. The same is also expressed in (7.19) and (7.20). 


12.3 Charge Conjugation of Bound States 

We now consider an electron in an attractive Coulomb potential: 

Ze 2 

eA 0 (r) - — — . (12.39) 

An electron state of negative energy corresponds via charge conjugation to a 
state of positive energy of a particle with the same mass mo but opposite charge 
(i.e. a positron ) in the same potential or - which amounts to the same - a state of 
an electron in a repulsive potential. Due to this correspondence the energy changes 
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+moc 2 ■ 


-moc 2 - 


(a) (b) (c) 

its sign, and, because of the 7 2 matrix in the operator C 7 0 = i 7 2 , the large (upper) 
and small (lower) components are exchanged under charge conjugation, but the 
densities and current densities remain the same. 

The spectrum of an electron in a repulsive potential is shown in Fig. 12.5c. It 
shows a series of bound states (the same number as for an electron in an attractive 
potential - case b) which lie near the lower continuum. Switching on the repulsive 
potential adiabatically, these bound positron states emerge from the lower contin- 
uum ; analogously to the bound electron states, which are pulled down from the 
upper continuum while switching on the attractive potential (Fig. 12.5b). The posi- 
tive energy continuum in the repulsive potential corresponds exactly to the negative 
energy continuum in the attractive potential. This has important consequences for 
the construction of the vacuum 2 in the hole and field theory. Having defined earlier 
the vacuum state as a Dirac sea of negative energy states filled up with electrons, 
we can now, because of the equality of electrons and positrons (charge-conjugation 
symmetry), consider the vacuum as a “symmetrized sea of electrons and positrons ” 
(see Fig. 12.6a below). A one-electron state is visualized in Fig. 12.6b. In this exam- 
ple the electron occupies the second strongest bound level. It is not distinguishable 
from a corresponding positron hole in its second strongest bound level (emerged 
out of the positron sea of negative energy into the gap —m$c 2 < E < m^c 2 ). Both 
configurates exist equally beside each other. If they were distinguishable, then 
no particle-antiparticle symmetry (charge-conjugation symmetry) would exist. The 
existence of this symmetry allows for the suppression of one half of the figures 
(usually the positron side) in Figs. 12.6a and b. 

We note that the vacuum state defined in this symmetrized form obviously has 
total charge zero, but infinitely large energy. The latter must be renormalized to 
zero, a subject that is tackled in quantum electrodynamics. 

From earlier studies (Exercises and Examples 9.6 to 9.9), we know that in this 
case the spectrum consists of the positive energy continuum m$c 2 < E < 00 , of a 
number (more precisely an infinite number) of discrete bound energy levels with 
—moc 2 < E < m$c 2 , and of the negative energy continuum —00 < E < —m^c 2 . 
The bound states constitute the discrete energy levels in the energy gap between 
—moc 2 and +moc 2 (see Fig. 12.5). The states of negative energy can be obtained 
from the states of positive energy by charge conjugation, and vice versa. One 
should understand the following point very clearly. 


Fig. 12.5. Energy spectrum 
of a Dirac electron in an at- 
tractive Coulomb potential, 
(a) free electron without po- 
tential, (b) electron in an 
attractive potential - Ze 2 /r , 
(c) electron in a repulsive po- 
tential -fZe 2 /r 


moc 2 


TTiO c 2 


-X- 

-X- 

-X- 

-X 


-X- 

X- 

-X- 


(a) 


moc 2 . 


— 7 71QC 2 - »<■■*- 
-X - K - 
X X 
XX- 
— X— X 


o- 


(b) 


Fig. 12.6. (a) Illustration of 
the “symmetrized” vacuum 
state, consisting of states of 
negative energy filled with 
electrons (x) and positrons 
(•). (b) Illustration of a state 
with an electron in the sec- 
ond strongest bound level, 
which is indistinguishable 
from a positron hole in the 
second strongest bound pos- 
itron level 


2 Refer also to W. Greiner, J. Reinhardt: Quantum Electrodynamics , 2nd ed. (Springer, 
Berlin, Heidelberg 1994). 
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Historical comment. It is amusing to read papers on the quantum theory of 
radiation. 3 Trials were made, e.g., to identify the electron hole as a proton. A num- 
ber of arguments were given to justify that the mass of a hole should be greater 
than the mass of the electron, even though they were predicted by the hole theory 
to be equal. On the other hand, it should be possible, due to the hole theory, that an 
electron and a hole annihilate, with the emission of two photons. This probability 
was calculated by R. Oppenheimer, P.A.M. Dirac and I. Tamm 4 with the result that 
matter and antimatter annihilate each other in a very short time. However, when 
the positron was discovered, that which had previously caused the most serious 
difficulties became the greatest triumph of the theory. Indeed, the prediction of the 
existence of the antiparticle must be considered as one of the greatest successes of 
theoretical physics. 


12.4 Time Reversal and PCT Symmetry 

In this section we want to investigate the time-reversal transformation ( T transfor- 
mation) and show its connection with parity transformation and charge conjugation. 
Similarly to space reflection (parity transformation), time reflection is an improper 
Lorentz transformation. Yet another symmetry is the gauge invariance of the Dirac 
field ip, which interacts with the electromagnetic fields A^, and which is - as we 
know - ensured by the minimal coupling p^ — (e/c)A fl . However, we do not want 
to elaborate further on this point, because for the present discussion it will be ir- 
relevant. Later on, however, (when considering the problems of renormalization in 
quantum electrodynamics) the gauge invariance will be very important. 

From (4.9) we already know the parity transformation or spatial reflection, 
which is represented by 


Pip(x, t ) = ip'( x' , t) — Tp'(-x, t) — e lv 7 °ip(-x, t) , (12.40) 


with 

x' = -x . 

The spinor ip'(x',t) is usually referred to as the spatially reflected spinor or spa- 
tially reflected wave function. In the case of plane waves the momentum, but not 
the spin is reversed under spatial reflection; exactly as one would expect for clas- 
sical quantities. The parity transformation has the following effect on the various 
operators: 

PxP~ ] = x' = -x , (12.41a) 

Px 0 P ~ 1 = Xq = x 0 , (12.41b) 


3 For example, we recommend Fermi’s paper: Rev. Mod. Phys. 4, 87 (1932), which was 
written between the formulation of the Dirac equation in 1928 and C.D. Anderson’s 
discovery of the positron in 1933. 

4 J.R. Oppenheimer: Phys. Rev. 35, 939 (1930); P.A.M. Dirac: Proc. Cambr. Phil. Soc. 26, 
361 (1930); I. Tamm: Zeitschr. fur Physik 62, 7 (1930). 



12.4 Time Reversal and PCT Symmetry 


313 


PpP- l =p' = -p , (12.41c) 

Pp 0 p- l =fc=p 0 , (12.41d) 

PA 0 (x,t)P-' = A(,(x',f) = A 0 (x,t) , (12.41e) 

P A(x, t)P~ x = A\x /) = -A(®, I) , (12.41f) 

where 

x' = -x . 


The first four relations are immediately understandable, while the last two (12.41e,f) 
denote the scalar and vectorial nature of the potential Ao(x,t) and vector potential 
a(x, t), repsectively. Applying the same arguments leading to (3.30), but now with 
the special parity transformation (12.40) yields 

ip - ~ c 4- - m ocj Tp(x, t) = 0 

(m°+M' - - 07' - m oc) P~ x ip'{x', 0 = 0 

and after multiplying with P from the left 

P (po 7° +Pii ~ -^ 07 ° - -Aj(x,t)y‘ - moc) P~ l ^\x',t) = 0 

(po7° + (+pi)(~ 7 ') - “^07° - + A/(*,0)(-7') - m ocj ip'(x',t) = 0 

(po7° +Ph i - ^ A o( x 'i 07° - ^(®', 07' ~ 0 = 0 

[//' - e -fi(x! , t) - moc 4>’(x', 0 — 0 . (12.42) 

Since under the parity transformation (4.1, 4.2) 

Po = Po , t' = t , 

p- = - Pi , x' = -x , 

Ao(x, t) = Aq(x', t') , A(x, t ) = -A’(x', t') 

one finally arrives at 

{p' ~ e ~$ ~ mQpj ip'(x r , t') — 0 . 

This means: The parity-transformed wave function \P'(x',t) = P\P(x,t ) = 
!?'(— x,0 [see (12.40)] obeys the same Dirac equation as the original wave func- 
tion P(x. t ). We say: The parity transformation P leaves the Dirac equation and 
all physical observables unchanged. 

How does the parity invariance express itself physically? One can express this 
most simply by a sequence of observations of a state described by a wave function 
\P(x,t): These observations are registered on a film, but the pictures are taken via 
a mirror. In other words: The camera films the observation of a state ^(x, t) in a 
plane mirror (see Fig. 12.7). We then call the dynamics described by a state ^(x, t) 
parity invariant, if the events registered on the film as reflected images can also 
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mirror 




picture of state 


Fig. 12.7a, b. Illustration of 
parity. The state is 

represented by a Dreibein 
(three unit vectors of a 
Cartesian coordinate sys- 
tem), which is filmed via 
the mirror. On the film the 
z axis appears reversed (be- 
cause of reflection). If the 
camera stood directly beside 
(in front of or under) the 
state, one would get the pic- 
ture shown in (b), with op- 
posite z axis 


be possible direct observations of the state 'P(x,t)\ i.e. observations of the state 
without the mirror. From the observation of the events in the mirror image of the 
film, we must not be able to say whether we are looking at the picture of the mirror 
image. Both must be equally possible in parity-invariant dynamics. 

We remark that a mirror image is not identical with full spatial reflection, since 
- as Fig. 12.7b clearly shows - in the mirror only the z axis is reversed (i.e. the 
axis _L to the mirror plane). Around the latter a rotation of n must be performed 
to obtain a full spatial reflection. However, such a rotation is a proper Lorentz 
transformation. The reflection described is thus a spatial inversion plus a proper 
Lorentz transformation (rotation). Since the theory (dynamics) is invariant under 
proper Lorentz transformations, the “mirror movie” yields exactly the information 
about parity invariance that we require. 

We now move on to discuss time reflection and time -reflection invariance. 
The physical sense is also explainable with the film example. This time the film 
registers the observations of the state &(x,t) just in its time sequence: we do not 
need the mirror, instead we run the film backwards. One calls the dynamics time- 
reversal invariant if the observations on the movie running backward could have 
happened the same way on the forward-running movie. It must not be possible to 
determine by watching the filmed events, whether the movie is running forwards 
or backwards. In other words: Both the observations on the forward-running and 
backward-running movies must be realizable in the state \P(x,t). 

In our case of the Dirac theory the dynamics will be time-reversal invariant if, 
by performing the transformation 

*' = -f , x f = x , (12.43) 


the form of the Dirac equation remains unchanged; the interpretation must not be 
changed, either (cf. Exercise 12.6). Then, the transformed wave function 

*') = tMx, t) = ^(®, -0 (12.44) 


describes a Dirac particle which propagates backwards in time. This is physically 
possible if lP^(®, /') also satisfies the Dirac equation. Equation (12.44) is a special 
case of the general definition (3.27) and the time inversion (12.43) is a special 
improper Lorentz transformation. T is an operator which acts on the spinor com- 
ponents, but not on space and time coordinates, a fact that should also be made 
evident in connection with the general scheme (3.27). 

Let us now construct the time-inversion transformation explicitly. Therefore we 
rewrite the Dirac equation in Schrodinger form: 


i h 


d'lftn (® j 0 
Ft 


H ( x , t ) / 0 n ( x , t ) 

|c& • — - a) + fimoc 2 + eAo(x, f)j 4> n (x, t) ,(12.45) 


where n characterizes the spinor’s quantum numbers. The time inversion of (12.45) 
is achieved by multiplying from the left by t : 


Tiht~ x 


dt 


Tip n (x,t) 


fH(x,t)f-'fip n (x,t) 


(12.46) 
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The notation Tiht 1 includes the possibility that the T operator may also contain 
complex conjugation. Since the time inversion as a special Lorentz transformation 
causes 


it follows, together with (12.44), that 

-tiht- l ^ , n (x,t , ) = fti(x, . (12.47) 

For t/>'(x, t) = fxp n (x,t) the same Schrodinger-like Dirac equation should hold as 
for ip n (x,t ) in (12.45); hence 

d 

ih—‘ip , n ( x , tl ) = H(x, t, )'il>' n (x,t') . (12.48) 

In case of time-inversion symmetry one obviously has to demand that H(x,t') = 
H ( x , —t) = H(x, t), and in this case an observer can, in principle, not distinguish 
between a forward and a backward running “movie”. 

Now the comparison of (12.47) with (12.48) yields a priori two possibilities of 
procedure: either we demand 

Tif- ] =i and TH(x,t)f- x = H(x,-t) = -H(x,t) (12.49) 

or 

tit~ x = -i and TH(x, t)T~ x = H(x , t') = H(x, - 1 ) = H(x, t) . (12.50) 

The latter possibility is the only possible one, because the operator equation 
TH(x, t)T~ l — H (x, t) can be included into the general scheme (3.27) of symme- 
try transformations. In addition, the condition (12.49) TH(x,t)T~ x = -H{x,t) 
would alter the spectrum of H by the time inversion in the special case of time- 
independent Hamiltonians, which can physically not be accepted. We also notice 
this fact if we look at the explicit form of H(x,t ): 

H(x,t) — ca • ihV — ^A(x, t)j + fimoc 2 + eAo(x, t) . (12.51) 

The conditions (12.49) cannot be satisfied: The vector potential A(x,t ) is created 
by electric currents j(x,t), which change sign under the transformation t — > — 1\ 
therefore it holds that 

TA(x, t)T~ x = A(x, -t) - - A(x, t) . (12.52) 

On the contrary the Coulomb potential is created by the electric charge density 
g(x, t), which remains unchanged under time inversion, and hence 

fA 0 (x,t)f- x =A 0 (x,-t) = A 0 (x,t) . (12.53) 

Furthermore we have 


A A 1 A A 1 

TVT~ X = V and TxT~ x 


= x 


(12.54) 
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because the application of T does not effect the space coordinates. Therefore we 
see that, for example, the choice (12.49) would yield 

faf~ x = a 

due to the term a • A(x, t). On the other hand the sign change of the term & • i V 
would imply 

tat~' = -a . 


Obviously both conditions are contradictory and, as a consequence, only (12.50) 
remains as a valid choice: It can be satisfied consistently, because with TiT~ l — — i 
it follows that 


TH(x, t)f~ x = fcaf~ x • \(f(-ih)t~ x ) fvf~ x - -TA(x,t)f 


-1 


“I - thqc^T 0T -f- e T Aq(x ) t)T 


= -TcaT~ 
= +H(x, t ) 


— i KV A(x, +t) 


+ m^c 2 t [3T * + eAo(x, — f) 
(12.55) 


[due to (12.50), (12.52) and (12.53)], if 
faT = -a , T0f~ l = +0 


(12.56) 


holds. Because / i / 1 = — i the operator T must contain the complex conjugation 
K, and therefore we set 


T = T 0 K , (12.57) 

where the matrix % must still be determined. If we insert (12.57) into (12.56), 
then 

= -a or = -i 

to0t o ~ x =0 or 7 o 7 ° 7’ 0 “ 1 = 7 ° . (12.58) 

Since only the matrix a 2 is purely imaginary, all other matrices being real [see 
(2.13)], these conditions explicitly read 

%&\Tq X - -&i , 

To a 2 f~ x = a 2 , 

With the help of the commutation relations (2.8) it can be shown that 

f 0 = — iaia 3 , f 0 -1 = ia 3 ai (12.60) 

satisfies these conditions, the factor i guaranteeing the unitarity of T. With 7 0 = 6 
and 7 ' = 6a, [see (3.8)] the complete time-inversion operator can then be written 
as 


fo&ito 1 


-a 3 


To0To~ X =0 


(12.59) 


T = —i&i&ik = vy x y’K = TqK 


(12.61) 
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In addition, we mention that the time-inverted Dirac equation (12.48) is form 
invariant, too. This was our intention. Indeed it was formulated in that way, and 
the comparison of (12.48) with (12.45) shows this immediately. 

Next we want to prove that the time inversion transformation T accords to 
the classical concept of time inversion. To that end we apply T to a free-particle 
solution (a plane Dirac wave) with positive energy. We know how a plane wave 
with four-momentum p M = {ep°,p} [p° is given by p° = +c(p 2 + m\c 2 ) 1/2 ] and 
spin s** = {s°, s} can be constructed (projected) from an arbitrary plane wave 
ip a (x), with the help of the projection operators, namely [see (12.33)] 

' (12 - 62) 


For particle with positive energy one has to set e = 1, and the T operator can act 
on this equation. With fipaixj) = r fj' a (x,t') = Toip^(x,t) we get 


Tlpeps 


f f zrf + m 0 c \ 

V 2m 0 c ) 

f f e//* + me 

0 \ 2 m 0 c 

j. ( q/* + mpe 

0 \ 2wo c 
{ €//' + m 0 c \ | 
\ 2 m 0 c J ' 


1 + Is?/* 




T’o-'T’o 
1 + 15 f 


1 + 15^* 


Ip'aiX’t') 


t-'^ a (x,t') 


(12.63) 


(because 75 is real). In the last step, equation (12.58) was used and the four-vectors 

p’ — and s' = {s 0 , -s} (12.64) 

were introduced. These projectors yield a free solution with opposite direction of 
the spatial momentum p and the spin 8 , if applied to the time-inverted plane wave 
This is the so-called Signer time inversion. It was introduced for the 
first time by Eugene Wigner in 1932. 5 

Now we want to see how the electron and the positron wave functions can 
easily be connected with the help of the time-inversion transformation. From our 
earlier considerations we know that the charge conjugate state ^ c , which is obtained 
from the state ^ by 

= Uif;* = C 7 V = C^Ki/j = CiJj (12.65) 

(C = i 7 2 7 °), describes a particle with the same mass mo and the same spin 
direction (polarization), but with opposite charge, opposite sign of energy and 
opposite momentum [see, for example, (12.38)]. If describes an electron 

with momentum p and negative energy, then 'i/’cC#? t ) describes a positron with 
momentum — p and positive energy, and both particles propagate forward in time. 

Now we combine the parity operation P [see (12.40)], the charge conjugation 
C [see (12.25)] and the time inversion T (12.61) and construct the wave function 


5 E.P. Wigner: Gottinger Nachrichten 31, 546 (1932). 
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V 7 pct( x') = PCt'ipix) = PC 7 0 0)* 

= Pcj° (i 'r l T 3 kiKx, - 1 '))* = pc 7 ° (rySVc*. -/'))* 

= -iPC^-y'^ipix, -/') = -iPi7 2 7°7°7 1 7 3 ^(®, -f') 

= e 1¥ ’7°7 2 7 1 7 3 ^(-® / , -f') = ie‘ ¥ ’i7 0 7 1 7 2 7 3 ^(-a; , ) -*') 

= ie lv 7 5 ip(-x') (12.66) 

(here 75 = was used.) If b(x, f) is an electron wave function of negative 

energy, then ip PC r is a positron wave function of positive energy, and this is effected 
by the charge conjugation C. Therefore, we can also read ( 12 . 66 ) as ip pcr(x,t ) 
being a positron wave function with positive energy moving forward in time and 
space (positive t, positive x). It is — up to a factor ie iv 75 — identical with an 
electron wave function of negative energy, moving backwards in time and space 
[negative t and negative x in the argument of ip(-x, -t) on the rhs in (12.66)]. 
For a plane wave with definite spin s fl and momentum p 11 one can deduce this 
result explicitly. The plane electron wave with negativee energy, momentum — p 
and spin — s is given by 

and moves backwards in space and time. If we apply the PCT transformation 
(12.66), we obtain 

*. CT (*,<) = ie^s ip^T 1 ) «- x ’ 

XX^XX^X'^ ) ■ 

This is evidently a positron wave function (because of the charge conjugation) with 
positive energy p 0 , positive momentum p and positive spin s , moving forward in 
space and time. 


EXERCISE 


12.6 Behaviour of the Current with Time Reversal and Charge Conjugation 

If the time inversion T is a given symmetry operation of the Dirac theory, then the 
rules for the interpretation of the wave function 

ip' a (x,t') = Tip a (x,t) , t' = -t (l) 

must remain the same for ip' a (x, t'). This means that observables consisting of bilin- 
ear forms of ip' a (x, t') and ip'+ (*> t') must be interpreted (i.e. physically explained) 
in the same way as those with ip a (x,t) and ip+(x,t). Naturally this is valid only 
up to the expected behaviour under time reversal of the special observable. The 
following examples will illustrate this. 

Problem, (a) Prove that the following relation is valid for the current: 


( 2 ) 
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Note: The indices on the lhs are lower indices (covariant indices) and on the rhs 
upper ones (contravariant indices). Similarly, show that 

(r)' = (r) , (p)' = -(p> • (3) 

(b) Show also the behaviour of these observables under charge conjugation C. 
Demonstrate especially that 

i> c ( x )l^c(x) = +ip(x)'y fl 'ip(x) 

and interpret this result. 


Solution, (a) The operation of time reversal transforms a spinor ip(t) into a spinor 
W) = T 0 r(t) f = -t (4) 


or 


4(t') = , 


(5) 


where the upper index “T” means transposition. In the usual representation of the 
7 matrices [see (12.60, 12.61)] one finds that 


A, -13 

To = 17 7 
Since 


y4 _ Q 


is valid, it may easily be seen that 

1 q = —17 '7 ' = —17 777 = —17 777 — — 1 v — 7 7 ) — 
where we have used the commutation relations of the 7 matrices and 

’ t ’ '1„,3„,1„,3 I 3 1 3 I 3 3 I n 

io • lo = >7 7 7 7 =-7777 = 7777 = U , 

i.e. To is unitary and Hermitian. From (6) it follows that 

7b7 1 7’o = -7 1 = -7 1 * , 

Wto = 7 2 = - 7 2 * , 

To^To = - 7 3 = — 7 3 * , 

m 0 'T T 0 I 

To 7 = 7 =7 


( 6 ) 


(7) 


( 8 ) 


(9) 


,0* 


( 10 ) 


because 7 1 , 7 3 , 7 0 are real and j 2 is purely imaginary, and we can simply write 
these four equations as 


T 7^7 = 7 * . 

Our task is to investigate for the current jj{x') the expression 

#(*') = M'WM') , 

so we obtain from (4), (5) and (11): 


( 11 ) 


Exercise 12.6. 


( 12 ) 
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= (V ,t (^)7o) 7 -T A ‘V'r(^) 

= ^(thoTo^Torit) 

= V’ T (07°7^*(0 = M) [ 7 0 7£] ai 8 V #(0 

= V #(0 [707^] M) = ^ T *(07j*7°^(0 
= V’ t (07j l 7V(0 = V’ t (O7 0 7/x7°7 0 ^(O 

= 0(O7mV’(O =7 m (0 (13) 

[in the last but one transformation we used (7)]. The time argument alone was noted 
in V>(a:, 0 . #e', ?')> because the position vector (x = x r ) remains unchanged under 
time inversion. The operators r and p commute with to, because they do not carry 
spinor indices (they are proportional to the unit matrix). For the position operator 
it holds, in particular, that 

= i> T (t)T 0 T 0 rr(t) 

= r^ T (0V’*(0 = r^ T *(0V»(0 

= ip f (t)ri)(t) ( 14 ) 

[^ T (tpp*(t) is purely real], wherefrom (r)' = ( r ) follows immediately by integra- 
tion. In the case of the momentum operator p = —ihV, we must explicitly write 
down the expectation value as an integral, in order to express clearly that p is 
Hermitian. Because of TopTo = p, one gets 

(PY = J V’r( a: > ? ')P^7'(£C,t , )d 3 ^ 

= J ip T (x,t)pip*(x,t)d ? 'x 
= J (— i fiVip\x)) , >p(x)d i x 

= - j i)\x)(-ihV)^{x)tfx - \h J tp\x)iP(x)dF 

surface of volume V 

= - J ipHx)pip(x) d 3 x + 0 = -(p) . (15) 

In the last but one line a partial integration has been performed, the surface term 
vanishing due to the general properties of wave functions at infinity. 

(b) For C = i 7 2 7 °, then, in analogy to (10) and (11): 

C 7 °C = 7 0 = 7 °* , 

C 7 2 C = 7 2 = - 7 2 * , 

C 7 ’C =- 7 ' =- 7 >* , 

C 7 3 C = - 7 3 = - 7 3 * , 

C'fd = y* , (16) 

which may be easily checked, so that 
A(x) = C~i°f*(x) , 

ipl(x) = i/) T (jc) 7 °C t , 


( 17 ) 
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and, further, 

C* = -C , C 2 = - 1 . (18) 

Now we can write 
i/'cT'Vc = 

= ip^y^C^y^y^Cy^ip* 

= ip J y°Cy 0 CCy»Cy°ip* 

= ^ T 7°7o7^7°^* = ^ T 7^7°^* 

= V>a (7^7°) aa , C' = 

= V’T^ = ip^ip , (19) 

which shows everything that is required. In the penultimate line, care is needed 
since we have to interchange the order of ip* and ip. This is possible if one regards 
ip as the wave function of an electron or positron; however, if ip is regarded as a 
quantized field operator ip, then an additional minus sign appears, due to the fact 
that ip and ip* are fermion operators, so that the charge conjugated current operator 
reads 

: 'Pcl/^'Pc := - : ipy^ip : , (20) 

where : . . . : implies normal ordering. 6 We now have to interpret the result (19) in 
the hole picture. If ip is the wave function of a state with positive energy, then ip c 
is a state of negative energy which is already occupied in the vacuum, so that the 
corresponding current must not be counted, since the current density of the vacuum 
is defined to be zero. On the other hand, if we create a positron in that state, then 
this state will be empty and the current ip^ipc is missing, so that we have to 
ascribe the current density —ip c j^"fic to the positron, which has the opposite sign 
as the corresponding charge density of the electron. From that we can conclude 
that electrons and positrons have opposite charges. 


The correctness of the interpretation given at the end of Exercise 12.6 can be 
explicitly verified for the Dirac equation involving electromagnetic interaction. To 
do this we write the Dirac eigenvalue equation for states with negative energy in 
the presence of the electromagnetic four-potential A fl — (Aq, A) as 


a • (- 


-i KV A ) + finiQC 2 + eAofx) 


ip(x) = —Eip(x) 


( 12 . 68 ) 


and operate with the PCT transformation (12.66). Since under space-time inversion 
= — x^, see (12.52) and (12.40)] 

PTA^x) (. Pty 1 = A'^x') = +A^(x) (12.69) 


Exercise 12.6. 


6 See W. Greiner, J. Reinhardt: Quantum Electrodynamics , 2nd ed. (Springer, Berlin, Hei- 
delberg 1994). 
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is valid, then (12.68) transforms under PCt into 


PCT 


a • (-i HV - £ -A{x, + (3moc 2 + eA 0 (x, t) (PCT)-\PCf)tp(x, t) 


= -E(PCf)tp(x, t) 


(12.70) 


This results in 

ippcrix') = ippcr(-x, — 0 
= PCT'ip(x,t) , 


PctaiiPCty 1 - Pfctait- l fc- l p- x 
= -Pc&ifc-'p- 1 
= -P&iP- 1 

= &i , 

PctV(Pct ) _1 = -V = V' , 
PCtj3(PCf)-' = -/ 3 , 


and therefore (12.70) becomes 

COL ■ (-iHV' + e -A'{x')} 


+ j3moc 2 - eA' 0 (x') \ ippcrix') 


= Ei>pcr{x') , 


(12.71) 

(12.72) 

(12.73) 


(12.74) 


with x' = —x. 

If (12.68) was the Dirac equation for particles with charge e, rest mass mo 
and negative energy (— E) moving forward in space and time (x,t), then (12.74) 
is the Dirac equation for particles with charge —e, rest mass mo and positive 
energy (+E) moving backwards in space and time [in the argument of li'pci (x'): 
x' — -x = -{ ct , -x}}. We can therefore interpret positrons as electrons of 
negative energy moving backwards in space and time. This important result serves 
as one of the fundamental concepts of positron theory, 7 which was founded by 
Stuckelberg and Feynman. In quantum-electrodynamical perturbation theory (which 
is based on it) we shall extensively both make use of, and recognize the great 
advantages of, this formulation. 8 

Remark. The form of the interaction between the electron-positron field and the 
electromagnetic field has been assumed to be 

j»{x)A»(x) = e -^^{x)A^(x) . (12.75) 

This followed as the simplest interaction which is gauge invariant, and it is equiv- 
alent to the interaction between electrons and the electromagnetic field which we 
know from the nonrelativistic limiting case. This interaction is C, P and t in- 
variant (see Exercise 12.6). Whether or not these symmetries are realized depends 

7 E.C.G. Stuckelberg: Helv. Phys. Acta 14.32L, 588 (1941); R.P. Feyman: Phys. Rev. 76, 
749 (1949); ibid. 769. 

8 See W. Greiner, J. Reinhardt: Quantum Electrodynamics, 2nd ed. (Springer, Berlin, Hei- 
delberg 1994). 
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on the actual interaction. As we know, an additional interaction with the magnetic 
moment of the form 

(12.76) 

for spin-^ particles with anomalous magnetic moment ( g factor ^ 2 as for example 
for protons and neutrons), is in general necessary. This additional interaction also 
shows all the symmetries mentioned above, as can be easily verified. Extending 
the Dirac theory to other spin-^ particles, for example ji mesons, and to other 
kinds of interaction (e.g. the weak interaction), the assumption that the C, P, and 
t symmetries are also valid suggests itself. 

This is an assumption which has to be verified by experiment, i.e. success or 
failure of phenomena predicted by it. Lee and Yang showed 9 that the symmetry 
of parity is no longer valid in the weak interaction; however, the much weaker 
assumption of Lorentz invariance and the connection between spin and statistics 
(spin-^ particles obey Fermi statistics, spin-0 particles obey Bose statistics) 10 al- 
ways guarantees the invariance of the interaction under the product PCT. This is 
the famous PCT theorem, which was derived by R. Liiders. 11 


12.5 Biographical Notes 


WIGNER, Eugene Paul, Hungarian- American physicist, * 17.11.1902 in Budapest, pro- 
fessor at Princeton University and afterwards at Louisiana State University in Baton 
Rouge/Louisiana. He studied at the Universities of Berlin and received his Ph.D. in electrial 
engineering from the Technical University in Berlin in 1925. He made important contri- 
butions to theoretical physics, above all by introducing systematically group theoretical 
methods into physics. His book “Group Theory and its Application to the Quantum Me- 
chanics of the Atomic Spectra” became a classic. But also his contributions to the theory 
of nuclear forces, neutron absorption and quantum mechanics, and parity conservation are 
widely known. W. decisively participated in the development of the American atom bomb 
and the construction of the first nuclear reactor. In 1963 W. received the Nobel Prize in 
physics, together with Maria Goeppert-Mayer and J.H.D. Jensen. In 1958 he received the 
Enrico Fermi prize and in 1961 the Max Planck Medaille [BR]. 


9 T.D. Lee, C.N. Yang: Phys. Rev. 105, 167 (1957). 

This is extensively discussed in W. Greiner, B. Muller: Gauge Theory of Weak Interactions , 
2nd ed. (Springer, Berlin, Heidelberg 1996). 

10 This is discussed in W. Greiner, J. Reinhardt: Quantum Electrodynamics 2nd ed. (Springer, 
Berlin, Heidelberg 1994). 

11 R. Liiders: Kgl. Dansk. Vid. Sels. Mat.-Fys. Medd 28, no. 5 (1954). 




13. Klein’s Paradox 


In the following, we want to concern ourselves with the scattering of an electron 
with energy E and momentum p = p z at an infinitely extended potential step 
(Fig. 13.1). First we shall study this problem from the point of view of the one- 
particle interpretation of the Dirac equation and then, in Example 13.1, we shall 
look at the same problem using the framework of hole theory, understanding better 
the resulting situation, which looks paradoxical at first sight. 1 

For the free electron we have ( E/c ) 2 — p 2 + m^c 2 , whereas in the presence of 
the constant potential, 




2 

_ —2 - 22 
— p + THqC 


(13.1) 



'V(z) 


i 


z 


Fig. 13.1. An electron wave 
propagates along the z axis 
and hits a potential step of 
strength Vq 


is valid, where p denotes the momentum of the electron inside the potential. The 
Dirac equation and its adjoint then read 


{ 


E - eV 
c 



(13.2a) 


^{ E c:^ ~ P m ° c \ +ih ^ W~ &k = 0 ' 

^ ' t=i OXk 

We now assume that 

eV — Vo for z > 0 , 
eV = 0 for z < 0 , 

and that the incoming wave is given by 


ipi = «i exp ^ -(pz - Et ) 


so that, inserting (13.3) into (13.2a) and using a = <23 it follows that 
E 


f E ) 

< dtp — (3moC > m, = 0 . 


(13.2b) 


(13.3) 


(13.4) 


O. Klein: Z. Phys. 53, 157 (1929). 


1 
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Since we require it, ^ 0, then because of afi fia = 0 we conclude that 


E 2 


p 2 + mlc 2 


(13.5) 


and, moreover, due to our interest in the incoming electrons we choose E > 0. 
The momentum of the reflected wave must be —p, whereas the momentum p of 
the transmitted wave is given by (13.1). For small Vo, P is positive, so that in the 
first instance we can set 


' r = H r expj^(-pz -Ef)j , ipt = M,exp|^(pz -£oj 


Ipr = Mr exp { -(- pz ~ Et) 
and therefore, due to (13.2a), 

+ ap — fimoc 1 M r = 0 and 


{§ 


{ 


E- Vo 


— ap - fimoc 


| «t = 


(13.6) 


0 . (13.7) 


The total wave function must be continuous at the boundary, i.e. for z — 0 


Mj + M r = M t 

must be valid. From (13.4) and (13.8) therefore follows 


E . \ 

— - (3moc 1 (m ( + Ur) = +ap(Ui - M r ) , 


and with (13.7) and (13.8) we get 


^ - j3moc \ (m; + Ur) = 0^- + ap ) (m ; + M r ) . 


Thus we have 
Vo 


+ ap (Mi + Mr) = +ap(Ui - Ur) 


or 


jy +a(p +p)^u T = - -a(p -p)^Ui . 

We multiply both sides by Vo jc - a(p +p), which, because of a 2 
(13.1) and (13.5), leads to 

(2Vo/c)(—E/c + ap) 

U x — ^ — «i = fUi . 

V 2 /c 2 -(p+p)i 

Analogously we find for the adjoint amplitude 
uj = ru- , 


(13.8) 

(13.9) 

(13.10) 

(13.11) 

(13.12) 
1 and with 

(13.13) 

(13.14) 


i.e. 


U^ U T 


2V 0 /c 


Vq/c 2 -ip+p) 2 


U 


E . - 

hap 

c 


so that using the identity 


(13.15) 
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t - p c1 t 

cu au, = — u u, 

' E 1 


(13.16) 


(which can be easily derived from the equations of motion for uj and u,) it follows 
from (13.15) that 


n/w r = 


2V 0 /c 


v 0 2 /c 2 -(p + py 


E 2 2\ t 

F +p)“S c 


2Ep u 
Mj' au\ 


2 V 0 m 0 


Vq /c 2 ~(p +p) 2 


m/wi = Ru^Ui 


(13.17) 


Thus the quantity R is the fraction of electrons which are reflected: For Vo = 0, 
R = 0, whereas for V 0 — E — m 0 c 2 [i.e. p — 0, from (13.1)], R = 1, and 
all electrons are reflected. If Vo increases still further (Vo > E — moc 2 ), then p 
becomes imaginary. Then we set 


V>t = Ht exp j-/xz - i j , 


(13.18) 


where p is now a real quantity, p must be greater than zero, since otherwise the 
density on the rhs of the barrier would be infinitely large for z — > oo. On the other 
hand, because of (13.6), p = +i hp, i.e. for (13.13) we get 


(2 V 0 /c)(E/c - ap) 

U v = — ^ ; ~U\ 


it = 


V 2 /c 2 — (p + i hp) 2 
and therefore 

(2 V 0 /c) 2 (E 2 /c 2 - p 2 ) 


(2V 0 /c)(E/c - ap) t 
V 0 2 /c 2 ~(p - i hp) 2 U ' 


n/« r — 


[(Vo/c + p) 2 + p 2 h 2 ] [{Vq/c - p) 2 + p 2 h 2 ] ' 
Using (13.1) and (13.5) we conclude that 
2 V 0 (2E - Vo) 


U; U\ 


p =p 


i.e. since p 2 = —p 2 h 2 , 


— ±p) +p 2 h 2 = 2— (~±p 

c } c \ c 


(13.19) 


(13.20) 


(13.21) 


(13.22) 


and therefore u}u s = u- «, . 

This means that the reflected current is equal to the incoming one. Behind 
the boundary there is an exponentially decreasing solution for the wave. Due to 
(13.20) the condition for that case is p 2 < Vo(2E — Vo )/c 2 , and for increasing 
Vq this condition is fulfilled as soon as Vo exceeds the value E — c(E 2 /c 2 — 
p 2 ) 1 / 2 = E — moc 2 . If Vo increases further, then, due to (13.21), p first increases, 
reaches its maximal value for E = Vo and then again decreases, becoming zero 
for Vo = E + moc 2 . For still greater values Vo > E + nioc 2 , p again assumes real 
values, so that (13.13) and (13.17) are again solutions of the problem. However, 
in this region the kinetic energy E — V o is negative, so that this is a classically 
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forbidden situation. The group velocity, which due to (13.16) and (13.1) is given 
by 


u gr — 


E - Vo 


(13.23) 


therefore has the opposite direction as the momentum p in this region if we choose 
p > 0. Since the group velocity is the velocity of the moving wave packet, it 
looks as if the transmitted wave packet came in from z = +oo; on the other hand 
this contradicts the condition which only allows an incoming wave packet from 
z = -oo. We thus have to choose p < 0 [i.e. the negative sign of the root in 
(13.1)]. However, this condition is not included in the Dirac equation, but is forced 
upon us by the physical boundary conditions. 2 In the discussion given by Bjorken 
and Drell 3 this has not been taken into account, the reflection coefficient reading 


(1 ~ r) 2 
(1 + r) 2 


with 


(13.24) 


p E 4- moc 2 
p E - V 0 + moc 2 


(13.25) 


For Vo > E + moc 2 the fraction indeed becomes negative, but r always remains 
greater than zero, because we have to choose p < 0 due to the boundary conditions. 
Hence for the reflection coefficient we always have R < 1 and not, as given by 
Bjorken and Drell, R > 1. 

We thus have seen that for Vq > E + moc 2 a fraction of the electrons can 
traverse the potential barrier by transforming the original positive value of the 
kinetic energy to a negative one. The group velocity of the tunneling electrons is, 
due to (13.1), given by 


V 0 -E 


= c 


1 - 


mpc 2 \ 2 
Vo-Ej 


(13.26) 


which for V 0 = E + nioc 2 is just zero and for V 0 — > oo approaches the velocity of 
light. When Vo = E + moc 2 , the reflection coefficient from (13.17) is just R = 1 
(total reflection); it decreases for increasing Vo down to the value 

( E /c — p) 

= Rrin = lim R(V o) - ) J (13.27) 

Vo— >00 ( E/c+p ) 

for Vo — > oo. The corresponding fraction of electrons travelling through the bound- 
ary surface is thus 


P = 


2p 

E/c + p 


(13.28) 


where (3 is called the transmission coefficient and a -(- 3 = 1 ! For p = moc (i.e. 
electrons with a velocity 80% that of light) we get with (13.5): (3 « 2/(2 1 / 2 + 


2 H.G. Dosch, J.H.D. Jensen, V.L. Mueller: Phys. Norv. 5, 151 (1971). 

3 J.D. Bjorken, S.D. Drell: Relativistic Quantum Mechanics , ed. by L. Schiff, International 
Series in Pure and Applied Physics (McGraw-Hill, New York 1 964). 
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1) ~ 0.83, i.e. 83% of the incoming electrons penetrate the potential barrier. 
This large transmission coefficient also remains for Vo not approaching infinity but 
only several rest masses. Calculations by F. Sauter 4 using a smoothened potential 
edge have shown that this large transmission coefficient, which is classically not 
understandable at all, does not occur if the width of the increase from V = 0 to 
V — Vo is of the order of the Compton wavelength, i.e. 

d < — ■ (13.29) 

rriQC 

This unexpected largeness of the transmission coefficient is known as Klein’s para- 
dox , and its interpretation given here is completely that of a single particle. In the 
framework of this representation it is not necessary at all to consider pair produc- 
tion (as has already been done by Bjorken and Drell at this level), though we shall 
see in connection with the hole theory that the boundary conditions change (no 
wave coming in from z = +oo), that is we do not have to demand p < 0 any 
longer (see the following Example 13.1). 


EXAMPLE 


13.1 Klein’s Paradox and the Hole Theory 

The hole theory becomes important if one wants to describe the behaviour of a 
particle wave striking a potential barrier with Vo > m^c 1 4 - E. 

The Dirac equation for a plane wave moving in the z direction with spin up is: 

(a) for region I (see the following figure) 



(■ caj,p z + flmoc 2 ) ip — E ip ; 

(1) 

(b) for region II 


(' caipz + Pm 0 c 2 ) ip = (E -V 0 )ip , 

(2) 


with solutions 


Electron wave and potential 
barrier 


F. Sauter: Z. Physik 73, 547 (1931). 


4 
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Example 13.1. 


= a 


i i ^ 

P\c 

E + m 0 c 2 

Vo/ 


Jpiz/h 


p\c = ^E 2 -ml 

( 

ipu = B 


V 


1 

0 

-p 2 c 


Vo - E — m Q c 2 
0 


Jpu/h 


P2C = \J (v 0 - Ef - m y . 


( 3 ) 


(4) 


Decisive for the explanation is the fact that for V 0 > E + m 0 c 2 the momentum p 2 
becomes real again, allowing for free plane waves to propagate in region II. This can 
only be understood by the existence of a second energy continuum corresponding 
to the solutions of the Dirac equation with negative energy (see below). From 
the impacting wave (3) one part is reflected (maintaining energy and momentum 
conservation): 


rt = c 


( 1 

-P\c 


\ 


Q -'\p\z/h 


^ E + m 0 c 2 J 


(5) 


and the other part propagates further (4). We must require at z = 0 that the wave 
functions be equal inside and outside the potential (this does not mean continuity!): 


V>I (z = 0 ) + ipliz = 0 ) = Tpuiz = 0 ) 


( 6 ) 


From this the equations determining the various coefficients follow: 


A + C — B and 

p 2 E + m 0 c 2 


A-C = -B 


PiVq-E- m 0 c 2 


= -B< 


(V 0 - E + niQC 2 ) {E + m 0 c 2 ) 

(V 0 - E - m 0 c 2 ) ( E - m 0 c 2 ) ^ 


Thus we have 


(7) + ( 8 ) =» A = f(l- 7 ) | C 1+7 


(7) - ( 8 ) 


C = f (1 + 7) 


1 -7 


and 


B _ 2 

A 1—7 


(7) 


( 8 ) 


(9) 


( 10 ) 
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With the expression for the particle current 
j(x) = ci>* (x)ai()(x) 
and 

(o, , P ' C 0, \) 


Ip} &2=A*[ 0, -i 

=A* 


E + moc 2 5 

P ' c , 0 , c~ ipiz/h , 


E + mot -2 

o 1 o') g-iPizA 

E+m 0 c 2 ’ ’ ’7 


it follows that 


2pic 2 

7 , = AA z- e, 

,/I E+ m 0 c 2 z 

Correspondingly 




E + m 0 c 2 z ’ 
lp 2 c 2 


V 0 -{E+ moc 2 ) 1 ’ 

Thus the ratios of the currents [7 in (9) is real] are 
liil _ (1 +7) 2 


IjiI (1-7) 2 

I JhI 4 


-7I = 


47 


|JiI (1-7) 2 ’ M (1 - 7) 2 
It can be seen from (8) and 7 > 1, leading to 


( 11 ) 

( 12 ) 

(13) 

(14) 

(15) 

(16) 
(17) 


(18) 


UII > |3,I ■ 09) 

This result corresponds to the fact that the flow of j n is in the (— z) direction, i.e. 
electrons leave region II, but according to our assumptions up to now, there are no 
electrons in there anyway. A reinterpretation is thus necessary, and in doing this 
the solutions of negative energy formally derived previously are treated seriously; 
thus there exist two electron continua: 

To prevent the transition of all electrons to states of negative energy one has 
to require that all electron states with E < —moc 2 are occupied with electrons 
(see Chap. 12). This hypothesis permits the following explanation: The potential 
Vo > moc 2 + E raises the electron energy in region II sufficiently for there to be 
an overlap between the negative continuum for z > 0 and the positive continuum 
for z < 0. In the case of Vo > moc 2 + E the electrons striking the potential barrier 
from the left are able to knock additional electrons out of the vacuum on the right, 
leading to positron current flowing from left to right in the potential region. 

From this notion it is understandable that there occur free plane wave solutions 
in region II given by (4) (see following figure), called positron waves. Furthermore, 


Example 13.1. 
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Energy levels of the free 
Dirac equation 



according to the hole theory, the negative continuum states are occupied, so that it 
is now possible to understand the sign of j n in (17) by assuming that the electrons 
entering region I are coming from the negative continuum: Correspondingly the 
relation 

Ji + fi = 3\ (l - = Ji (] ll f) 2 = in (20) 

holds. Since the holes remaining in region II are interpreted as positrons, it is 
possible to describe this effect in an alternative manner: The phenomena described 
above can be understood as electron-positron pair creation at the potential barrier 
(as shown in the figure below) and is related to the decay of the vacuum in the 
presence of supercritical fields. 5 


Energy continua of the Dirac 
equation at a potential barrier 



5 This is discussed by J. Reinhardt, W. Greiner: Rep. Prog. Phys. 40, 219 (1977); and is 
covered in more detail in W. Greiner, W. Muller, J. Rafelski: Quantum Electrodynamics 
of Strong Fields (Springer, Berlin, Heidelberg 1985). 
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In 1930 W. Pauli postulated the existence of the neutrino in order to guarantee the 
energy and momentum conservation for the weak interaction, which at that time 
seemed to be violated in the /3-decay experiments. Since the energy of the neutrino 
could not be determined even in the most sensitive measurements of the (3 decay 
of nuclei, the interaction of this postulated particle with matter must be extremely 
small. For example, it must not have electric charge, and accordingly, mass and 
magnetic moment must be assumed to be nearly vanishing, or even zero. The 
particle was named “neutrino” and abbreviated by ‘V. Because of the relativistic 
mass-energy relation, a particle with rest mass m v — 0 moves with the velocity of 
light. The experimental upper bound for the rest mass of the electronic neutrino is 
about a few electron volts and thus less than a thousandth of the electron’s rest mass. 
Therefore the assumption m v = 0 seems to be reasonable. Further experimental 
observations of the angular momentum balance during (3 decay showed that the 
neutrino has spin Consequently the Dirac equation for nio = 0 should be the 
fundamental equation of motion for the neutrino. 

Neutrinos and photons can be regarded as equal as far as charge, magnetic 
moment, mass and velocity are concerned, the main difference between the particles 
being, respectively, the half-integer and integer half-integer spin. There is also a 
difference between the neutrino v that occurs in (3 + decays and the antineutrino V 
that is emitted during the (3~ decay. Historically the existence of the neutrino can 
be experimentally proved as an outcome of the (3 decay (/ 3~ decay) through the 
recoil of the atomic nucleus in the reaction 


n — * p + e + v 


Further, the inverse /3 decay (/3 + decay) can be initiated by the neutrino and 
observed in experiments: 


p + v — * n + e + . 

Another empirical property which has been found by accurate studies of the in- 
verse (3 decays is that the antineutrino always has the same distinct spin orientation 
compared to its momentum direction. Assuming in the calculations that the spin of 
the antineutrino can be oriented parallel as well as antiparallel to its momentum 
results in theoretical cross-sections half as large as the experimental values. Precise 
experimental analyses have shown that the spin of the neutrino is antiparallel and 
the spin of the antineutrino is parallel to its momentum direction (see Fig. 14.1). 
This is the basic phenomenon of parity violation: If parity was conserved, neutri- 
nos as well as antineutrinos must exist in nature with both spin directions. Finally, 



P_ 

V 





Fig. 14.1. The spin of the 
neutrino is always directed 
antiparallel to its direction of 
motion; the opposite is true 
for the antineutrino 
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in this short introduction we mention that there are different kinds of neutrinos, 
which can be related to electrons, muons and tauons by their specific behaviour in 
weak-interaction processes. 1 

In 1929 Hermann Weyl proposed a two-component equation to describe mass- 
less spin- 4 particles, 2 but the Weyl equation violates parity invariance and therefore 
it was at first rejected. Then, after parity violation of the weak interaction was ex- 
perimentally proved in 1957, Landau, Salam as well as Lee and Yang took this 
proposal 3 and regarded the Weyl equation as the basic equation of motion of the 
neutrino. 

We first consider the Dirac equation of a particle with mass mo = 0: 
d# 

i h— = cat • p&(x) , (14.1) 

and this equation no longer contains the ft matrix. The anticommutation relations 
for the three matrices di, &2 and a 3 , 


{rt/ , otj } — 2<5 y , 


(14.2) 


can be satisfied by the 2 x 2 Pauli matrices a, . Merely the necessity of constructing 
/3 as the fourth anticommuting matrix requires the introduction of 4 x 4 matrices, 
and the necessity of describing particles with spin- ^ by two spinors is obviously 
connected with the particles’ masses. Therefore this reason disappears if the mass 
is zero: The wave equation of such a particle can be set up with only one spinor. 

The wave equation for the two-component amplitude \P^(x) describing the 
neutrino reads 


d<£ (+) . , 

i h — - — = c& -p& + \x) 
at 

or, after dividing by i h, 
0 $<+> 


dt 


= —c& • V$ (+) (jc) , 


(14.3) 


(14.4) 


where <7, are the 2 x 2 Pauli matrices. The plane-wave solutions of the Weyl 
equation are given by 


& +) (x) 


1 

v/2£(27t) 3 


e" 1 px / h u w (p) 


with 


P = {Po,p} = 



p • x = poxo - p-x 


x = {xo, x} and 


(14.5) 


(14.6) 


1 The theory of weak interaction is extensively discussed in W. Greiner, B. Muller: Gauge 
Theory of Weak Interactions 2nd ed. (Springer, Berlin, Heidelberg 1996). 

2 H. Weyl: Z. Physik 56, 330 (1929). 

3 L. Landau: Nucl. Phys. 3, 127 (1957); T.D. Lee, C.N. Yang: Phys. Rev. 105 1671 (1957); 
A. Salam: Nuovo Cimento 5, 299 (1957). 
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Following the plane-wave solutions for electrons, the neutrino wave functions are 
normalized in such a way that the norm remains invariant under Lorentz trans- 
formation (see Chap. 6). w (+) (p) is a two-component spinor which satisfies the 
equation 

/?ow (+) = & *pu^ . (14.7) 


This relation is of special interest, since we note that due to (14.7) the solution for 
a given sign of energy po corresponds to a certain orientation of the spin <x with 
respect to the direction of motion p. By applying the helicity operator & • p/\p\ to 
both sides of the equation and using the relation 

(<r . A)(& . B) = A • B + i& . (A x B) , 
we get ((X • p) 2 , and therefore 

(po - P 2 ) « (+) = 0 , (14.8) 

so that only non-vanishing solutions for « exist if 


Po = ±|pI (14-9) 

is valid. This is naturally the relativistic energy of massless particles, and here it 
again follows that neutrinos move with the velocity of light. But, of course, this 
result was already implied in our initial demand that the massless particle has non- 
zero energy. In the common representation of the Pauli matrices, with the z axis 
in the p direction, the solution of (14.7) reads 


u 


(+) 


1 

0 


(14.10) 


This solution describes right-handed massless particles with spin in the direction 
of motion. By this we mean 


r 'P (+) _ « ( 1 

\p\ “ Mo 



(14.11) 


or, literally: The helicity operator has a positive eigenvalue, the spin is directed 
parallel to p, which corresponds to a right-handed screw if we look in the direction 
of motion (i.e. in the direction of p/|p|), and which is illustrated in Fig. 14.2. 

For states with positive energy (po = +|p|) the wave equation (14.3) only 
has waves of positive helicity as solutions. With (14.7) and the aid of (14.9) we 
see immediately that we have the reverse result for states with negative energy 
(po = — |p|)*. there, the wave equation (14.3) only contains waves of negative 
helicity as solutions. This remains valid even in the hole theory, where a wave 
function with negative energy, negative momentum, and negative spin direction is 
interpreted as an antiparticle with positive energy, positive momentum, and positive 
spin direction. This assignment of ±po <=> particles, antiparticles to the helicity is 
contrary to the requirement of the experiments dealing with weak interactions (see 
Fig. 14.1). 

To describe left-handed massless particles, we must obviously start with the 
equation 



Fig. 14.2. The spin of the 
massless particle described 
by the wave equations (14.3) 
or (14.7) points in the di- 
rection of p , corresponding 
to a right-handed screw. By 
convention the helicity of the 
particle a • p/\p\ = +1. 
A neutrino cannot be such a 
particle, because it is known 
from experiments that the 
neutrino has helicity — 1 
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ih-^- = -cfr-p<P ( \x) . (14.12) 

Replacing & by —& also yields a realization of the commutation relation (14.2), 
so that (14.12) is then a possible Dirac equation for massless spin- 2 particles, in 
the same way as (14.3). With the ansatz 


^~\x) 

we find 


1 

^2£(2tt) 3 


e -i P'*/V->(p) 


(14.13) 


(-) - (-) 
PoU K } — -CT -PU K ' 


with 


« ( -> = 


(14.14) 

(14.15) 


In (14.14), pq — E/c can again be either positive or negative. We call solution 
(14.13) a neutrino state, because, as already mentioned repeatedly, the experiments 
show that neutrinos appear only as left-handed particles (with negative helicity). 
In this state the spin is antiparallel to p and can be representend by a left-handed 
screw (see Fig. 14.1). For the antineutrino state (po < 0) we have the opposite 
case, the particle is right-handed. Results achieved in /3-decay experiments show 
that the neutrino always moves antiparallel to its spin direction. This means that the 
helicity or longitudinal polarization of a neutrino with positive energy is negative, 
while the helicity of a neutrino with negative energy is positive. The solution with 
Po = — |p| yields a spin parallel to p and can be represented by a right-handed 
screw. 

On the other hand the helicity of a particle with positive energy, described by 
(14.4), is also positive. Therefore, this particle of (14.4) may be identified with the 
antineutrino of (14.12). As already mentioned, according to the interpretation of 
the negative energy states within the hole theory, the antineutrino has a momentum 
which is opposite to the empty negative energy state, and also the spin flips. 
Therefore, the relation between spin direction and momentum of the antineutrino is 
represented by a right-handed screw according to (14.14). The massless antiparticle 
which belongs to (14.3) or (14.7), respectively, has negative helicity; it obviously 
behaves exactly like the neutrino described by (14.14). Nevertheless, it should be 
mentioned that stating that the neutrino (antineutrino) is always left handed (right 
handed), only makes sense if the rest-mass is exactly zero. Otherwise a Lorentz 
transformation, which transforms a left-handed particle into a right-handed one, 
can always be found. 

To derive the neutrino current we rewrite (14.12) as 


1 

c dt 


— <x 


v$< -) = 0 


(14.16) 


Combining the unit matrix and & to form a four- vector = {11, +<r}, we can 

write the Weyl equation in a more compact way: 

<V V M <2> (_) = 0 , (14.17) 


and the Hermitian conjugate equation reads 
V' i (<p ( - ) ) t ^ = 0 . 


(14.18) 
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We multiply the first equation from the left by (3> ( ) ) t , the latter one from the 
right by and then add both equations. As usual this yields 


V /i (<P ( - ) ) t ^ /i # ( - ) = 0 , (14.19) 

which is a continuity equation with the four-current 

• (14.20) 

Its space and time components are 

j = _ ) (14.21) 

q= (^ (-) ) t <? (_) . (14.22) 

The normalization constant of the neutrino wave function can be derived from the 
integral with positive definite density g , if we require that 

f gtfx = 1 . (14.23) 


By considering various types of interactions, neutrinos can appear together with 
other spin-^ particles which have a finite mass and therefore are described by 
four-component wave functions. To have a unified description in such cases, it is 
appropriate to also introduce a bispinor wave function for the neutrino. To provide 
the connection between the two-component solutions of the Weyl equation with the 
already known four-component electron spinors, we go back to the Dirac equation 
for a particle with rest mass mo. However we choose a different representation of 
the Dirac matrices a and f3 , namely 


a. 


&i 0 

0 —G[ 


(14.24) 


$ = 


0 -1 
-B 0 


(14.25) 


One proves immediately the anticommutator relations of the Dirac matrices: 


ati&j + ajoti 


°U (»J 0 

0 )\Q 0 -aj 

didj+ajdi 0 

0 d,dj + Oj&i J ij ’ 


a, 0 \ 

0 -at ) 

(14.26) 


&i0 + P&i 


&, 0 W 0 — 1\ / 0-11 

0 o ) + \ -l 0 

" -o') + U o')-» • 


&i 0 
0 ~&i 


(14.27) 


This representation has the disadvantage that the four components of the bispinor do 
not split up into small and large components in the non-relativistic limit. However, 
as neutrinos have at least approximately zero mass and are therefore relativistic 
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particles, this disadvantage is in practice insignificant. In this representation, with 
the help of the two-component notation 


<£ = 


'<£>(+) 

<p(-) 


(14.28) 


the Dirac equation can be split up into 

Q<p ( +) 

i h — - — = — i Kco • — moc 2 &-^ 

at 

= co — moc 2 tP (_) , (14.29) 

/)( fi(-) 

i h — - — = i hc& • — moc 2 ^ +) 

at 

= —co • — moc 2 ^ (+) . (14.30) 

Note that the upper and the lower components of <P are coupled by the mass term. 
In the limiting case mo — > 0 two decoupled two-component equations are obtained, 
corresponding to the respective Weyl equations. Then <£ (+) describes right-handed 
and left-handed massless particles. Because of the fact that in nature neutrinos 
or antineutrinos appear only with a definite helicity, we must require that in the 
four-component description two of the components vanish. This is achieved by 
applying the projection operators 

P± = ±(11 ± 75 ) (14.31) 


to the spinor P of (14.28). In the representation used here (14.24) and (14.25) 75 
is diagonal, and with 7 s = i 7 ° 7 ' 7 2 7 3 = 75 it follows that 


Thus we get 

!A +) =-(ll±75)<F= f Q J • 

Also, by application of 7 2 = 11, one evidently gets 


(14.32) 

(14.33) 

(14.34) 

(14.35) 

(14.36) 


Equations (14.33) and (14.34) are in agreement with the Dirac equation only in 
the case that the particle mass is exactly zero, i.e. only then are and iP* - ) 
solutions of the Dirac equation. Indeed 7 s anticommutes with all 7 matrices and it 
obviously commutes with the mass term. Hence, for vanishing rest mass mo, ip { ~ ] 
and P { ~ ) are eigenfunctions of the Hamiltonian, the helicity operator and of 75 . 

The two-component Weyl theory is thus equivalent to a four-component Dirac 
representation. However, in the framework of the Weyl equation, the distinction 
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between particles and antiparticles is superfluous. The two possible states of the 
neutrino are only characterized by parallel and antiparallel orientation of the spin 
with respect to its momentum. 

Now, we want to analyse the non-invariance of the two-component theory 
under parity transformations. Let us have a look at the two-component equations 
(14.3) and (14.7). By performing a space inversion (p — > — p, x — *• —x, a — > a) 
a state with the energy po = |p|, momentum p and helicity a • p/|p| = 1 will 
be transformed into a state with po = |p|, momentum — p and helicity —1. Such 
a state, though, does not exist in this two-component theory of massless spin- j 
particles. 

There is another way to recognize the non-invariance of a two-component 
theory for space inversion, p is a polar vector, a is an axial vector and thus a • p 
is a pseudoscalar under inversion. This is also obvious from (14.35) and (14.36) 
which give rise to 

tp(±) 75 #(±) = ±ip(±)^(±) . (14.37) 

The left-hand side is a pseudoscalar density while the right-hand side is a scalar 
density. Thus helicity eigenstates are no parity eigenstates. The wave equations 
(14.29) and (14.30), which contain the mass mo, are symmetric with respect to 
reflection. In the description of a particle by only one spinor this symmetry gets 
lost. This symmetry, though, is not essential, because the reflection symmetry need 
not be a universal property of nature. Reflection symmetry only exists if the particle 
is replaced by the antiparticle simultaneously. Equations (14.29) and (14.30) are 
in fact invariant under x — > — x and The situation might be further 

clarified by observing that the mass term in the Dirac equation mixes the two 
helicity states while the kinetic terms conserve it 

& (ff — eJfi — moc ) 

= ^70 — ~ e 4 ) ~ 11 ^ 75 m oc ^ (±) 

= ( 11 \ 75 ) ^ 100 ~ e4)'P {±) - 7o«Jo^ (±) 

= 0 - e^ (±) - ¥ T) moc 2 & (±) . (14.38) 

Left- and right-handed fermions are thus strongly coupled if they have non- 
vanishing mass. 

Finally we consider the angular-momentum representation of the Weyl equation. 
With 

$* +) = <£ (+) e~ iEt/h , (14.39) 

then from (14.4) we get 

ha • V0 (+) = ip 0 <£ (+) • (14.40) 

The operators J 2 and J z commute with a • V, while L 2 does not. Accordingly, the 
angular momentum representation of the neutrino and antineutrino wave function, 
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respectively, contain spherical spinors x±w As we have already learned from the 
discussion of electron wave function including the coupling to external fields (see 
Chap. 9), the spin-orbit operator a • L also does not commute with & • V. The same 
holds for the parity operator multiplied by any arbitrary 2x2 matrix. Furthermore, 
there exists no 2 x 2 matrix which anticommutes with <r. As a consequence of 
these commutation properties the solutions will not have a definite parity and thus 
they will become a linear combination of both spherical spinors x±w We try the 
ansatz (see also Exercise 14.3) 


= g(r)XK,n + i f(r)x-K,n (14.41) 

and then proceed to make use of the following relations, which have been derived 
in the context of the two-centre Dirac equation [see (10.2)ff., (10.37), (10.54)ff.]: 



, (14.42) 

V • Lx k ,h = -h(K + 1)Xk,h, , 

(14.43) 

= ~ X— 

(14.44) 

This finally leads to the differential equations for the radial functions g(r ) and/(r): 

d g po, «+l 

d r= h f r 9 ' 

(14.45) 

Cl LD- 

II 

^ 1 
1 

(14.46) 

These are the same differential equations as those we derived previously for the 
case of electrons in the presence of a constant, spherical-symmetric potential Vo, 
assuming that we set mo — 0 and Vo = 0. The solutions which are regular at r = 0 
can be directly taken from Exercise 9.5, yielding the results: 


(14.47) 

St 

II 

(14.48) 

Note that the wave functions with k 
dent. Instead, one has 

= |k| and k = — |k| are not linearly indepen- 

d) (+) = i d> {+) 

(14.49) 

As a consequence of the unique helicity of the massless, two-component particle 
there exist only half as many states in a four-component description. If we start 
with (14.1 1) for left-handed neutrinos, the angular momentum representation would 
read (see Exercise 14.3): 

= 9( r )Xn,n ~ j f{r)x-K,n ■ 

(14.50) 
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EXERCISE 


14.1 Dirac Equation for Neutrinos 

Problem. Solve the Dirac equation for neutrinos and determine the eigenvalues of 
the helicity operator and those of 75 for both energy solutions. Make use of the 
standard representation of the Dirac matrices 


P = 


a = 


0 & 

a 0 




Solution. Consider the Dirac equation for massless particles: 
-i he a • V^x) = H&ix) 

By means of the usual ansatz for the time evolution 
V = ^e-' lE, / h 
we obtain 

Ei\) — — i hedt • 'Vtp . 


( 1 ) 


( 2 ) 


( 3 ) 


The solution of this equation can be represented in terms of plane waves. Accord- 
ingly, the ansatz 

t/> = e' p ' x/h u(p) ( 4 ) 

yields 

Eu{p) = cot • pu(p) , ( 5 ) 

where 

E = ±E p = ±\p\c . (6) 

Using the standard representation of the Dirac matrices, one has j 5 = i7°7*7 2 7 3 , 


0 n 

11 0 


75 = 

and obviously 

75-S 1 = 


0 1 
n 0 


& 0 
0 <7 


0 <T 
& 0 


= a 


holds. Thus, the Hamiltonian in ( 1 ) can also be written as 
H — ca • p = C75 EJ • p . 


(V) 


( 8 ) 


( 9 ) 


The eigenfunctions of the Hamiltionian H are simultaneously eigenfunctions of 
the helicity operator £ • p/\p\ and 75. The solutions of the eigenvalue equation 
can be directly taken from the plane-wave solutions of the free Dirac equation (see 
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Chap. 2). We set m 0 — 0 and obtain four linearly independent solutions for the 
spinor u. Hereby the z axis is chosen as the direction of the momentum p. For u 
it follows that 

helicity : 


+1 -1 +1 -1 



. / °\ 


!\ 

/°\ 

0 


1 


0 

1 

1 


0 


-1 

l _ i 

0 


Sol V-i/, A 0 1 Vi/, 

V ^ V ^ 


positive energy negative energy 

whereby normalization factors have been suppressed. The eigenvalues of 75 are 
found to be 


E 

Helicity 

Eigenvalue 

+E p 

+1 

+ 1 

-\-E p 

-1 

-1 

-E P 

+ 1 

-1 

-E P 

-1 

+ 1 


Obviously 75 and the helicity operator have equal eigenvalues in the case of positive 
energy solutions. Opposite signs result for solutions with negative energy, and the 
eigenvalues of 75 are just the negative of the helicity eigenvalues. 


exercise 

14.2 CP as a Symmetry for the Dirac Neutrino 

Problem. Show that the product of charge conjugation and parity transformation 
represents a symmetry transformation of the Dirac neutrino. 


Solution. In the standard representation of the Dirac matrices 7^ 
conjugated spinor is found via 

the charge- 

A(x) = c 1 °r . 

(1) 

Hereby the charge-conjugation operator C satisfies the condition 


1 

II 

7 

0 

e- 

* 

0 

<0^ 

(2) 

together with 


C = i 7 2 7° = -C~ l = — C f = -C T . 

(3) 


By means of a unitary transformation U , which transforms the standard represen- 
tation of the 7 matrices into the one we have used for describing the neutrinos [see 
(14.24) and (14.25)], the corresponding representation of the charge-conjugation 
operator can be found. The unitary operator U reads 
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Using 


7s7s 


it follows that 

u= vi( 


n n 
n b 


o n 

-B 0 


(4) 


(5) 


( 6 ) 


Indeed, we obtain 

0 7 '0-‘ = 0-/ s u< = I 


_ 1 
~~ 2 


B B 
-B B 


B B 
B B 

-a. 


Oi 

Oi 


B 0 
0 -B 


0 

-Oi 


-B B 


-B 

-B 


07 

0 

0 

-Oj 

-B 

B 

0 

-B 


07 

0 


-B 

0 


-B 

B 

= 7‘ 


(7) 


= 7 


( 8 ) 


Now we transform the charge-conjugation operator C into the new representation: 


C = UC s U~ l 
Accordingly, from 

Cs = i7s 7s = 1 


we derive 

i 

= 2 


B 

-B 

II 

-B 


0 o 2 
-02 0 


0 

- 0 2 


-02 

-02 


-02 

0 

-02 

02 


0 

-B 


= i 


= l 


-B 

B 

-02 

0 


0 

-02 


-02 

0 


(9) 


( 10 ) 


0 

02 


(ID 


Performing a parity transformation of the Dirac spinors we obtain 

ipp(x') = e lv, 7sV'(-*,0 • (l 2 ) 

In the following considerations the phase factor e lcp does not play any role and 
will be omitted. Combining charge conjugation and parity transformation it clearly 
follows that 

ip C p(x,t) = CjsJs^i—x, t) = Cip*(—x,t) 

= i (~o 2 ■ <l3) 

Finally we check whether the spinor 'ipcp fulfills the Dirac equation. For this pur- 
pose we use the relation 


Exercise 14.2. 
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-& 2 0 
0 <72 


o i 0 
0 — < 7 , 

-&2 0 
0 < 7 2 

-&2 0 . 

0 


~0 2 0 
0 <72 

o* 0 
0 -< 7 * 


( 14 ) 


With x' — —x we derive 


d 


iH dt + J ^ cp<<x ’ ^ 

=i' 4 + “- v ') i (“o 2 

= 1 I _ o 2 ? ) - i hoc* . V, ) r(-x, t) 

= '(~ 0 2 ?) + 


,(-02 0 

0 < 7 2 


^i/i 4- i ha • V' ^ ip(x', t) 


= 0 . 


(15) 


In (15) the factor in brackets vanishes; thus the combination of charge conjugation 
and parity transformation is indeed a symmetry transformation. 


EXERCISE 


14.3 Solutions of the Weyl Equation with Good Angular Momentum 

Problem. Derive once more the angular momentum representation of the solutions 
of the Weyl equation (14.41) and (14.50). 


Solution. We start from the Dirac spinor in standard representation: 


, = ( 9(r ) Xk,» 

^ Vi/(r) x-w 


(1) 


The transformation to the new representation is achieved by the unitary matrix 

o = -L( 1 1 
V2V-1 1 

Furthermore, the neutrino states are generated from & = by 

■ 0 L = \ (H-75)^ = ^ (1 - 75) , 

V'R = 5(11 + 75)^ = f(H + 75) UlpK.Li ■ 


( 2 ) 


( 3 ) 
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- — f° 0 
“ y/2 V° 2 


1 1 

-1 1 


In view of 
(1-75)0 
one obtains 

^ = -L( 0 °V 5(r) 

VL V2V- 1 I/WO x-^ 

= {- 9 (r)Xn,n + if(r)x-K,u) 

and correspondingly it holds that 


a + * )& = ^(o 0 


1 1 

-1 1 


and thus 


J_ / ° ° 

V2V- 2 2 


- — (2 2 

“ V2 V° 0 


1 


= x-L) = ^ r ^ + '' fWx -^ ) 


( 4 ) 

( 5 ) 

( 6 ) 
( 7 ) 


Since the factors xjl and global signs can be absorbed within normalization or 
phase factors, they can be omitted. 


Exercise 14.3. 




15. Wave Equations for Particles with Arbitrary Spins 


15.1 Particles with Finite Mass 

Here we want to outline briefly how to construct wave functions which describe 
particles with spin s — 1 , | , . . . out of solutions of the Dirac equation and also 
to study by what kind of wave equation they are generated. As already seen in 
Chap. 6, the lower components of free solutions of the Dirac equation with positive 
energy vanish in the case mo ^ 0 in the rest system of the particles [cf. (6.13)]. 
Thus, for E p = moc 2 (which means p, = 0 when we are in the rest system) the 
spinor components are given by w^(0) = S ra and thus 

u4 +) = 0 , a = 3,4 . 

The superscript (+) denotes r = 1,2, which characterizes solutions of positive 
energy. The tensor product 

“ai.r =■ “4 +) «> )<4 +) (°) ' ' ' 05. D 

S * v* y 

2 s 

has 2s indices which are, at first, independent of each other. But considering the 
total symmetric part of this multispinor 

wSJ/..r =: E "2-w = "SE;* 0 ) < 15 - 2 > 

P 

(the summation index p denotes permutations of the indices), we find the following 
linearly independent combinations in the rest system: 

W S...r(°> * = °> = 6 Otl 6 0 l ... 6 v i 6 T i , 

^ T (0, i = 1) = 6 a 28/3i6yi . . . 

+ ^q:1^/32^71 • • • <5i/1<5t1 

+ <5al<5/31<!>72 . . . S u i 8 t \ + . . . , 

w S..r (°- i= 2 s) = 6 a 2 60 2 . . . 6 u 2 6 T 2 ■ (15.3) 

The i th multispinor has i indices equal to 2 and 2s — i indices equal to 1. Each of 
these multispinors represents an eigenvector of the operator of total spin, with in 
the rest system is defined by 
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2 ' oia.' ft' ...vv ’ tt 


— I fir 3 


...6 r 


+ \nzl T , 6 OtCt' 6/3/3' . . . 

6vv' 

(a, ol\ (3, (i'\ . . . v, v'\ r, r' = 1 , . . . , 4) 


where 


t 


3 

olol ' 


<t 3 0 

0 <73 


( 1 0 0 0 ^ 

0-10 0 
0 0 10 

\0 0 0 - 1 / 


(15.4) 


is the well-known four-dimensional Pauli matrix. Indeed, it is easily proven that 

ift£V +) (0, i ) = h(s - i)w (+) ( 0, o . (15.5) 

This is demonstrated in detail in Exercise 15.1. Since, apparently, i = 0, ... ,2s 
is valid, the number of eigenvectors is just 2s 4- 1, and according to (15.5) the 
eigenvalues of (h/2)E 3 are s, s — 1, . . ., —5 + 1, — 5 . This directly demonstrates that 
the symmetric multispinor (15.2) may indeed be interpreted as the wave function 
of a particle with spin s, where the z component can obviously assume 25 + 1 
different values. An analogous consideration allows the construction of solutions 
of negative energy. In this case the upper instead of the lower components have 
to vanish in the rest system. This means that we only have to replace the indices 
1, 2 in the Kronecker deltas in (15.3) by 3, 4 and the superscript (+) by (— ), and 
this will be verified in Exercise 15.3. 


exercise 

15.1 Eigenvalue Equation for Multispinors 

Problem. Verify that the multispinors u/ +) (0, i) fulfill the eigenvalue equation 


i/ii;V +) ( o, 

i) = 

h(s 

— i)uJ 

(+) (0 

Solution. Using 






n 

0 

0 

°\ 

ipecac' = ( 

0 

0 

-l 

0 

0 

1 

0 

0 


VO 

0 

0 - 

-\) 


a 

0 

0 

°\ 

= | 

0 

0 

-l 

0 

0 

1 

0 

0 

> 

^0 

0 

0 - 

-1/ 


and (15.4) we get 
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h&u>< + \o,i=o)) 

Z / aP... 


h 


y 3 

^ ft R> 


2 ^olol' PP' ...vv'tt ,uj a' P‘ 

2 ^ olol' ^ ol* p\ • • • 8 u \8 r \ 

h - 


OtP...T 


h , 


+ fiot\ T^^'pp'fip'lfi'yl • • • 8 v \8 r \ + . . . + 8ct\6p\ ... < 5^1 —E Tr ,8 T ‘ 

= h s (6 al s m . . . $ Tl ) = hsu™( o, i = 0) 


and analogously 


1 


ft£V +) ( o,i = i) 


Ocp...T 

= 2^ aa '^ a ' 2 ^^ i + S a 2^S00>Sf}>i ...6 T ] 

A ^ A ^ 

+ . . . + <5 a 2<5/31 • • • 1 + • • • <5rl 


h 


h 


+ ^al X^/3/3'^/3'2 • • • d T \ + • • • + ^al^/32 • • • 1 + • • • 


2s — 1 


fialbtn ■ ■ ■ d T \ + ^a2^0\ ■ ■ ■ S T l + ■ ■ ■ + <5a2^31 ...S T \ 


+ fialfi/32 ■ ■ ■ &t\ ~ &al $02 ■ ■ ■ d T \ + ■ • • + 6a\&02 ■ . ■ d T \ + • ■ • 

= H(S — 1) [<5 a 2<5/31 • • • <5 T 1 + ^al^/32 • • • ^rl + • • •] 

(+) . 
a/ 3 ...r 


= h{s-\)u ( ^ T (0,i = l) 


(1) 


( 2 ) 


%E 3 u<+\0,i = 2s)) 

1 / CXP...T 

f\> ^ ^ /j, A ^ 

= 2 ^aa'^a'2^/32 . . . <5 T 2 + ^a2 ^ ^/j'^/3'2 • • • &r2 

+ - - - + 6 a 28 02 . . . — E\ T ,8 T '2 
= -hs(j ( ^ T (0, i = 2s) . 


( 3 ) 


Hence the action of (E 3 ) aa l ...TT' on the multispinors w (+, (0, i) is clarified, and the 
validity of the eigenvalue equation (15.5) is proven. 


Exercise 15.1. 
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EXERCISE 


15.2 Multispinor as Eigenvector of S 2 

Problem. Show by using the example of the multispinor a/ +) ( 0, i = 0) that it is 
an eigenvector of (ft 2 /4)l7^ a ,^, TT , with the eigenvalue h 2 s(s + 1 ). 

Solution. The vector operator E is defined in a similar manner to E 3 in (15.4); 
then, we have 

+ }*.«■ •£«■ + ••• + 1^0,0,- ■ t„. 

+ • E aot f + + . . . + • Ej T t' 


+ \EJrr' • EJc 


+ a£tt’ • + . . . 4- Z^T 


( 1 ) 


Here, the notation has been simplified by omitting the delta functions with respect 
to the not explicitly listed indices in each term of the sum [cf. (15.4)]. On the rhs 
of (1) we see 2s rows with 2s factors in each individual term. The quadratic terms 
can be easily calculated, e.g. 


iw = (£')L' + (£ 2 )L' + (£ 3 )L' = 3<5, 


Jota.' 


( 2 ) 


A similar expression is valid for every pair of indices 00', 77 ', . . . , rr'. Since each 
row of ( 1 ) contains just one of these quadratic terms, the first contribution to the 
expectation value of (1) is given by (3/4)2 s = ( 3 / 2 ) 5 . Just (25 — 1) mixed terms 
of the form 




( 3 ) 


A A 

remain in each of the 2s rows, where £ and £ act onto different indices because 
of the total symmetry of u^} T (0, i) in a, 0, . . . ,t. By considering the example 
u/ +) ( 0 , i = 0 ) it is most easily demonstrated that 

\ £ • £'u (+ \0, i = 0) = ±u/ +) (0, i = 0) . (4) 

Inserting u/ +) (0, i — 0) from the first equation of (15.3) and remembering that 

<7 0 


£ = 


0 


it follows in the representation of the Pauli matrices [see (1.65)] that 

‘ £( 3 / 3 '') ^a' l^/3'l ■ • • 

= 3 (■^’Ia'^/3' + + ^aa'^/3') <Sa'l<Vl • • • 

= 3 (^21 ^21 ^£* 2^/32 + £\\ ‘^21 ^ a 2^/32 + £\\ £ n ^ al ^/ 3 \ ) • • • 


((+1)<5 q 2<5/?2 + (— 1)<5c* 2<5/32 + (+l)<5al<5^l) • • • 






( 5 ) 
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In (1), each term of the form (3) yields the contribution 1/4 to the eigenvalue, and 
altogether, we obtain 

(\E 2 ) J + \0, i = 0) = [§* + 1(2 5 - 1 ) 2 *] a/ +) (0, i = 0) 

= *(* + l)w (+) (0,t =0) . (6) 

Multiplication by h 2 yields the desired result. Similar, but more involved, is the 
recalculation that the remaining a/ +) (0, i) of (15.3) satisfy the same eigenvalue 
equation. Together with (15.5), this proves that the 2s + 1 multispinors lJ 1+) (0, i ) 
of (15.3) describe particles with spin s. 


exercise 

15.3 Multispinor of Negative Energy 

Problem. Construct - analogously to (15.3) - the multispinor of negative energy 
in the rest system and, based on this, the eigenvalue equation analogous to (15.5). 

Solution. It follows immediately from (6.3) that, analogously to (15.3), 

^/. T (0 , i = 0) = 6 a3 603 ...6 t3 , 

U al3...T (0) 1 = 0) = <5(*4<5/33 • • • <5x3 + < 5a3<5/34<5 7 3 • • • <5x3 + . . . + <5 Q 3<5/33 • • • <5x4 > 

* = 2s ) = SaASfH • • • <5x4 . (1) 

Since 

/I 00 0\ 

ri 3 Jo 1 0 0 

0 0 10 

\0 0 0 -1 / 

the action of E 3 on T (0, i) is the same as the action of E 3 on T (0, i). 
For i — 0 this means that 

{\hE 3 J-\ 0, i = O)) a0 T = \h [E 3 aa ,s a , 3 6 P3 ...s t3 + b a3 El0,b 0 , 3 . . . <5 t3 

+ . . . + < 5 a 3 < 5/33 • • • ^tt'^t'3] 

= hs6 a3 6p 3 . . . 6 t3 = fou^ T ( 0, i=0) . 

The further calculation is analogous to that presented in Exercise 15.1. 


Now we can transform these multispinors into an arbitrary frame of reference, 
bearing in mind the work of Chap. 6, particularly (6.30) and Exercise 6.1. By 
“boosting” all factors of (15.2) at the same time, i.e. applying the operator S(p ) 


>a.a.' (3(3' 




Exercise 15.2. 


(15.6) 
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we get 

J +) (p, i) = S o/ +) (0, 0 , u w (p 1 i) = s(|)u H (0,i) .(15.7) 

(In Exercise 15.4 these spinors uj (+) (p, i ) will be explicitly calculated.) Now every 
wave function may be written as a superposition of plane waves: 

!^+U^;p,0 = ^ T (p.0e-®- x/A , 

o = ^(p, 0 e+ipx//i (i5.8) 

and thus 

^ap.Mx) = ^2 f c (+) (p,i)'P^ T (x-,p,i)d i p 

i ** 

+ f c (-) (p,i ) ^ T (x;p,i)d 3 p . (15.9) 

i 

Hence these multispinors fulfill in coordinate space the following Dirac equation 
for each index, separately: 

(i% • d - mc) aa , 'Pa'p.Ax) = o , 

(ifry • d - moc) TT , ^ a p... T ‘{.x) = 0 . (15.10) 

They are named the “Bargmann-Wigner equations”, after their inventors. 1 Of 
course each component is, in accordance with the properties of solutions of the 
Dirac equation, also a solution of the Klein-Gordon equation 

/ m 2 c 2 \ 

[D + j ^ a ..Ax) = 0 . (15.11) 

The quantities ^ a p...r( x ) can obviously be regarded as the components of the 
wave function of a particle with mass mo and spin s , that is composed of elemen- 
tary identical spin-^ fields, since there exist exactly (2s + 1) linearly independent 
components, each of which obeys (15.10) and (15.11). According to (15.5), each 
of these components is an eigenstate of £ 3 . 


exercise 

15.4 Construction of the Spinor T (p,i) 

Problem. Determine from (15.6) and the results of Exercise 6.1 the spinor 
ui al l T (lV)» Usin § the °P erator 

Sac*' /3/3' ... tt ' (p) ; 

hence deduce the result 

0 / - mc)aa'U < £ ) i3... T (P, 0 = 0 /- m O c bp , ^a0'...T^P^ 0 

= (i/ - moc) TT >(J+l' T ,(p, 0 = 0 . (1) 

1 V. Bargmann, E. Wigner: Proc. Nat. Sci. (USA) 34, 211 (1948). 
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Solution. In the standard representation (3.13) we obtain according to (6.32) the 
four-component Dirac spinors of momentum p by using the Lorentz transformation 


0J T (P) = S (-f)^ T (0) T = 1,2 , 

where, according to Exercise 6.1, 

*(-!)■ 


( 2 ) 


IE + m 0 c 2 
2 m 0 c 2 

( i 

0 
PzC 


0 

1 

P-c 


PzC 


P-c 


E + moc 2 

P+c 


E + moc 2 
~PzC 


E + niQC 2 

P+c 

E 4- triQC 2 
1 


E + moc 2 E + moc 2 


0 


E + moc 2 
~PzC 

E + moc 2 
0 

1 


( 3 ) 


/ 


Hence the two solutions of positive energy with spin up and down are the first 
two columns of matrix (3). However, in contrast to (15.1) it holds that 


U) 


(+) 

a(3 


... r (jM)^0 for a, r = 3,4 


( 4 ) 


Using (15.6), (15.7) and (15.1) we obtain 

,(+) 


UJ 


aj3 


UJ 


(+) 

ol(3 


(P,i 

= 0) 

= S\ 

( P' 
C Ej 

I., 4 ' 

< p' 

~ E , 

>„■ 

..S( 

< P\ 

V EJ rl 

(P,i 

= 1) 

= 5 | 

< p ' 

< E> 

L*' 

f P 
K E, 

>„■ 

,. 5 ( 

' P\ 

< E)t\ 


UJ 


(+) / 
ap...r ' 


s (- 


S ( 

'-£) .. 

J | 

( P' 

) 

V 

EJ a 1 


. EJ0 2 


V E* 

' Tl 

s (- 


5 ( 

' p\ 

.S I 

( P' 

) 


EJa 1 


. E)p\" 

\ 

K E, 

' r2 

(_£ 

) 

c 

(_P\ 




V E 

) Oil 

. j 

V EJt 2 




..,r ‘ 

‘scans’ 

’* the columns of matrix (3), 


( 5 ) 


posed of the solutions u T (p) of (6.33a), i.e. 
0 1 - m 0 c)uj T (p) = 0 t = 1,2 . 


( 6 ) 


Exercise 15.4. 


Hence (1) holds. 
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EXERCISE 


15.5 The Bargmann-Wigner Equations 


Problem. Show that the spinors (15.9) obey the Dirac equation in the form (15.10). 


Solution. We use the Dirac equation (6.33a) in the momentum representation for 
the four-component spinors co T (p), that is 

0/ - £ r m 0 c)u) T (p) = 0 for r = l,2,3,4 (1) 


and the “plane waves” (6.31) 
ipp(x) = u > T (p)e- i£ 'P x / h . 

Now the Dirac equation in coordinate representation reads 
(i/ry^cP - m 0 c) xp r p (x) = - m 0 c) uj r (p)e~ l£rP ' x/h = 0 

Each linear combination of these solutions 

with arbitrary functions c r (p) obeys the Dirac equation, too: 
(iky^ ~ m o c ) 

= Ysj ( 2 ^) 3 / 2 ' Cr(p) - m ° C ) 

= f (2 vK) 3/2 Cr(p) ~ m o c ) u r (p)e~' erP ' x/h 

= 0 . 


( 2 ) 

(3) 

(4) 


(5) 


For an arbitrary Bargmann-Wigner multispinor 


^ ' oc 0... t ( x ) 


= ?/ 


d 3 p 


(2ttH) 3 / 2 L 


uo 


(+) 
a.(3. ..r 


(p,t)e 


-ipx/h (+) 


ip 3) 


+ w L/3 ) ..r(P> , ‘)e 1/,J:/ V >(p,i) 


ipx/h (-)/ 


( 6 ) 


we obtain 

(: i^T nd p - moc) aa , ^ a ^... T (x) 
d 3 P 


-II 


(In ft) 3 / 2 


( 7 nP* - m 0 c ) aa , w S...r(P> Oe tpx/h c i+) (p, i) 


(7 nP* + "k>c) aa , w i a >l... T (P, i)e ,p x/h c { \p, i ) 


(7) 


etc. for all the other indices. However according to the construction of r (p, i) 
(cf. Exercises 15.2 and 15.3), it holds that 
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{W* - m <>c) QQ , u%„ T (p,i) 

= ([/ MOC)aa'Sa'oc" 00 ' ...tt' (^7) ^a" 0' *') 

= 0 . (8) 

Taking into consideration the relation 

u r (e r p) = S (-f^)a/(0) (9) 

[cf. (2) of Exercise 6.6] one gets, furthermore, that 
{luP^ + m 0 c) aa , T (p,i) 

— iff ^ 0 C^aa’ Sa'a" 00' ...tt' ”0'...t'^’>^ 

= 0 . ( 10 ) 

Of course, we obtain analogous results for the other indices /?,..., r, and hence 
we obtain 


(i - m 0 c) aa , $oc'0 1 ..Ax) = 0 (11) 

and, analogously, also 

(i/i7 M d p - mc)^, 'P a 0' y ... T (x) = 0 , 

(i hrtpd* 1 - m 0 c) TT , H' a 0 y ... T '(x) = 0 . (12) 

These are the so-called Bargmann-Wigner equations. 


Exercise 15.5. 


15.2 Massless Particles 

We have to modify the derivation of the Bargmann-Wigner equations for massless 
particles, because we cannot find a rest system for mo = 0. However, we can 
choose the z axis colinear to the direction of momentum: 

p» = (p°,0Ap) ■ (15-12) 

The Bargmann-Wigner equations (i.e. for each index the Dirac equations) in mo- 
mentum representation read 

P (±7’ - ,</>i - 0 , 

p(±V-7 3 ) tt ,w$. t ,<W=° , 05.13) 

where ± again denotes the sign of the energy. Multiplying these equations by j° 
yields 
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hW) aa , = & aa >U$l..r(P) = ±U*l. T (P) . 

(tV) tt , 4> T ,(p) EE &tt'<J£1' T .(p) = ±wg.» , (15.14) 

because 7 ° 7 ° = H. Neither in the standard representation (3.13) nor in the Majorana 
representation (cf. Exercise 5.2) of the Clifford algebra is the matrix 7 ° 7 3 diagonal. 
But, using a unitary transformation, we can find a new basis so that 


(15.15) 

This representation is called the chiral or Weyl representation, because the 
chirality operator 7 5 is diagonal in this particular representation (which will be 
shown in the next exercise). The name “chirality” will be justified in the following 
text. 




EXERCISE 


15.6 7 Matrices in the Weyl Representation 

Problem. Show that the 7 matrices in the Weyl representation obey the common 
commutation relations. 


Solution. In standard representation [cf. (3.13)] 


7o S 


n 0 
0 n 


75 = 17 V 7V 


0 11 

n 0 


(i) 


where the upper index S stands for “standard representation”, (5.9) yields 

7M + 7 5 S 7m= 0 > (2) 

and (3.11) gives 

7^7o +7o7^ = 2p M0 H • (3) 

In the Weyl representation the matrices 7 , remain unchanged, only 75 and 70 are 
exchanged: 


„w 


7 / =7/ 


w 

7o 


-7f 


7 5 W = 7o 


(4) 


Because of (2) and (3) the anticommutation relations (3.11) are not changed by 
this transformation. The transformation is explicitly given by 

7?=S^ 


with 


S 


72 


n -n\ 
n 1 ) 


(5) 
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In the Weyl representation the operator on the lhs of the Bargmann-Wigner 
equations becomes diagonal so that the eigenfunctions may be determined conve- 
niently. Before proceeding to do so, we want to emphasize a remarkable property 
of the zero-mass Dirac equation. Due to the absence of the mass term an addi- 
tional symmetry arises: for any solution uj(p) there is always the spinor 7 5 u(p) 
simultaneously eigenfunction of the equation, since the matrix 7 5 commutes with 
the operator a 3 = 7 ° 7 3 appearing in the equation of motion (15.14) and two com- 
muting operators can simultaneously be diagonalized. Therefore the solutions of 
the Bargmann-Wigner equations may be classified by the quantum numbers of 7 5 
( chirality ): it is convenient to employ the chiral representation, where 


75 = i7o7i7273 = 


n 0 
0 -n 


(15.16) 


Obviously the chirality operator is already in diagonal form (which explains 
the name of the representation). With respect to a chosen inertial system, we define 
the chirality of the solution to be positive if the eigenfunctions satisfy 


75 OJ ^ ip ) = (+l)u^V) 


(15.17) 


where the subscript “+” designates positive chirality. With this notation, the so- 
lution u+(p), for example, refers to a state with positive energy [superscript (+)] 
and positive chirality (subscript +). Hence, the spinor uj^(p) must have the form 


/«?,• 


^(p) 


(p)\ 
w®<P) 

0 

0 


u {±) (p) 

0 


(15.18) 


and, on the other hand, solutions with negative chirality must satisfy 


7 5 o£ t) 0>) = (-l^V) 


(15.19) 


i.e. they will be of the form 

/ 0 \ 

OJ^ip) = 


0 

0 

,<±>r 


wl. 3 (p) 

V 5W 


The functions 


« (± > =: 


(J 


UJ 


(±) 

+1 

(±) 

+2 


0 

v {±) (p) 


=: 




u 


,(±) 

-3 

,(±) 


(15.20) 


(15.21) 


just defined are now two-component spinors. Consequently, the Dirac equation 
(15.14) splits into two-component equations, namely 


o 3 u = ± 


a 3 v = =F^ (±) 


(15.22) 
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Since there exists only two eigenfunctions of <j 3 , we conclude that v {±) (p) = 
an identification that implies significant consequences, as we shall see. 
However, the eigenfunctions of 



correspond to the eigenvalues ±1, i.e. for positive (negative) signs of the energy 
only the upper (lower) components of the two-spinor w (±) do not vanish. That 
means, in this case the spin orientation must be parallel (antiparallel) to the direc- 
tion of momentum. In contrast, the spinor v (±) has exactly the opposite property: 
for positive (negative) energy only the lower (upper) component is non-vanishing 
and the spin orientation is antiparallel (parallel) to the momentum, which, by defi- 
nition, is assumed to point along the z axis [see (15.12)]. In other words: zero-mass 
fermions (e.g. neutrinos) with definite chirality have the property that the helicity 
& • p/\p\ of the particle depends on the sign of its energy. In the case (15.17), 
zero-mass particles with positive energy have positive helicity, whereas zero-mass 
particles with negative energy carry negative helicity, and this partly explains the 
name chirality (“screw sense”). If we interpret the properties just explained in 
terms of the hole theory (cf. Chap. 12), where a wave function of negative energy 
and momentum — p corresponds to an antifermion with positive energy and mo- 
mentum +p, one observes that zero-mass fermions and anti-zero-mass fermions 
with positive chirality both also have positive helicity (they are “right-handed”). 
Similarly, fermions and antifermions with negative chirality both carry negative he- 
licity (they are “left-handed”). With this insight, the term “chirality” now becomes 
evident: chirality for zero-mass particles is equivalent to their helicity. 

In analogy to the wave functions for particles with finite mass, one may now 
proceed to construct a zero-mass, symmetric Bargmann-Wigner multispinor from 
the zero-mass Dirac solution in the chiral representation [cf. (15.1-15.3)]. However, 
since we employed a unitary transformation to pass to the chiral representation, now 
all the components with indices 1 and 4 in (15.3) contribute to wave functions with 
positive energy, which is also reflected by (15.22). Due to the chiral symmetry of 
the particular components, we can now search for eigenfunctions of the Bargmann- 
Wigner equations for each component that is simultaneously an eigenfunction of 
7 5 , i.e. with u a p... T (p) being a solution, 'y 5 aal tv a 't3...r(p), 7^/^'...r(p), etc. are 
solutions too. Taking into considerations the symmetry of the Bargmann-Wigner 
multispinors, it follows that the number of possible solutions reduces from 2s l 
to only two, since for positive energy and positive chirality there are merely 

W + + a/?...rO>) ~ <5al<5/31 • • • <5 t 1 • (15.23a) 

Similarly, for positive energy and negative chirality, 

~ «a4^4 • • • <5x4 , (15.23b) 

and, as before, the subscripts “+”, ” designate the chirality, and the superscript 

(+) denotes the positive energy. 

These are the two eigenfunctions of the operator for the total helicity that cor- 
respond to the two extreme eigenvalues ±s . For negative energy solutions the con- 
clusions are completely analogous. Hence, the wave functions may be transformed 



15.3 Spin-1 Fields for Particles with Finite Mass: Proca Equations 


359 


to any Lorentz frame and subsequently be re-expressed in terms of plane waves 
to construct arbitrary wave packets in configuration space as linear combinations 
which satisfy the Bargmann-Wigner equations in configurations space. 

Unfortunately, the outlined procedure for constructing the Bargmann-Wigner 
fields for particles with arbitrary masses does not allow us to find a Lagrange 
formulation, which is of fundamental importance for the quantization of the theory. 
To infer a Lagrangian density, one needs to transform the general Bargmann- 
Wigner equations for each particular spin orientation separately in a skillful way. 
In the following we will illustrate this concept for the case of spin-1 fields. 


15.3 Spin-1 Fields for Particles with Finite Mass: 

Proca Equations 

In this case the Bargmann-Wigner field is labelled by two indices. The two Dirac 
equations for the symmetric matrix ^^(jt) may be written as follows: 

(ifry^ - m 0 c) &(x) - 0 , &(x) (ifryJ<9 M ~ m 0 c) = 0 (15.24) 

or, in detail 

i}hd^ at - mocSaa') Paipix) = 0 , (15.25a) 

d^i ~ mocSpp') = 0 . (15.25b) 

Since the 4 x 4 spinor is symmetric, it may be expanded in terms of a complete 
set of symmetrical elements of the Clifford algebra standard representation, the 
latter consisting of ten symmetrical matrices 

7 , (15.26) 

where C = i 7 2 7 ° is the charge conjugation matrix that was defined in (12.23) and 
satisfies (12.24). The remaining six matrices 

7 m 7 sC , i 75 C , C (15.27) 

are antisymmetric; thus it holds on the one hand that 

(rc) T = C T 7 mT = -C (y° 7^7°)* 

— — 7 ^C , (15.28a) 

and on the other hand, we have 

(i7 5 C) T = C T i7j = -i7 2 7°(+i75) = -iys C ■ (15.28b) 

Therefore, we define 

*(x) = ^ 7 »C<P»(x) + ^CG^ix) , (15.29) 

where the coefficients x ) and G^(x) are generally complex and transform under 
Lorentz transformations like a vector and an antisymmetrical tensor, respectively. 
The Bargmann-Wigner equations (15.24) now become 
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(i/j.7 • d - m 0 c ) f^mocy^(x) + C = 0 , 


+ ^a^G^ix)^ C (i h-y 1 ■ d - m 0 c) = 0 . (15.30a) 

With regard to all indices, we explicitly rewrite the first of the above equations, 
which, for example, reads 

(ifrr • 9 - moc) aa , Qw 0 c(7 M C) a , / 3^(x) + ^(o^C^G^x)^ = 0 , 

(ihj ■ d - m 0 c )/3/3' ^m 0 c(7 /i C) a/3 ^ M (x) + ^(<v,C) Q ^G^(x)j = 0 . 

(15.30b) 


It now becomes obvious that the coefficients (p^(x) and G >llJ (x) do not take 
part in the matrix multiplication, and therefore they may equally be placed between 
the matrices 7 M and C or between a^ u and C, as has been done in (15. 30a, b). 

Now, in the above representation of C it holds that C 7^ = ~7 M C. Thus, we 
can factorize C out of the equations: 

(imocd a (£ M (x)7 a 7 M - m^j^ix) + \ih'y a a^ v d oc G tlu (x) 

-jmoca^G^ix)) C = 0 , 

(im 0 ca /1 ^(x)7 a 7 // + m$c 2 7 M < / Ax) + ^iha^j a d a G^(x) 

+\m 0 co^G^ l/ (x)) C = 0 . (15.31) 

Using the relation (cf. Exercise 3.2) 

[7°, = 2i (g a » Y ~ 9 au 7 m ) , (15.32) 

it follows for the difference of the two equations (15.31) that 

m 0 c (d°V" - d»ip a - G a »)o ail C - 2 7 „C (hd a G a » + = 0 . 

(15.33) 

The coefficients of the linearly independent matrices Co a/ g and 7 must 
vanish separately. Hence, for mo ^ 0, this implies that 


G**" = & l (p v - d v ip li , 


d^G^ = -^t 2 - u 


h 2 


(15.34) 


(15.35) 


These are the so-called Proca equations. 2 Expressed in terms of the vector field 
ip** they have the form 


2„2 


□<^-^(^)+^0^ = O 


(15.36) 


from which the subsidiary condition 


2 A. Proca: Le Journal de Physique et le Radium 7, 347 (1936). 
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= 0 (15.37) 

follows directly. Thereby we then obtain 

(□+^ = 0 , (15.37a) 

d ^ = 0 . (15.37b) 

Just as in (15.34) and (15.35), these are again the Proca equations, but now solely 
obtained using the potentials <p M . The appertaining Lagrangian density is 

£p.oc = - ■ G-. rav - 

, (15.38) 

and we shall further comment on this in Exercise 15.10. 

Note that from (15.34) an identity follows in form of the homogeneous equation 

deG^ + d»G ve + d u G B > 1 = 0 , (15.39) 

which can be formulated by means of the “dual” tensor 

= . i e ^ Ba G ec , (15.40) 

also as 

d^G^ = 0 . (15.41) 


15.4 Kemmer Equaton 


The Proca equations can also be cast in another linearized form. For that purpose 
we introduce the ten-dimensional “spinor” 


X 


Xi 


. Xio - 


(15.42) 


Its components are connected with the Proca fields as follows: 


Xi 

X3 

X5 




-01 


-03 


yjm^c 
1 _ 


13 


y/m 0 c 


Jmoc j 
X7 - - — <P 


X2 = — 


X4 = - 


X6 = ~ 


X8 = - 


yjm^c 

1 


V me 

l 

y/moc 

y/moc 


G 02 

G 23 

G 12 


X9 


V>noC 3 


Xio = — i 


y/moc o 


h 




(15.43) 
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Then the equations of the Proca theory (15.34) and (15.35) can be written as 


(ih(3 fl d fi - m 0 c) x(x) = 0 


(15.44) 


where the 10 x 10 matrices can be chosen as follows: 



( 

0 

0 

-n 

o f \ 

1 

/ 0 

0 

0 

o 

II 


0 

-n 

0 

0 

0 

0 

o f 

o f 

?*• 

II 

0 

0 

0 

—S k 

5* 

0 

\ 

V 

0 

0 

0 

0 J 

V iK k 

0 

0 


'£’j 

0 f 

0 / 


(15.45) 


with k = 1,2,3. The relation (15.44) represents a set of 10 differential equations, 
which are known as the Kemmer equations} The elements of the 10 x 10 matrices 
/3 M are given by the matrices 


/0 0 0\ /I 0 0\ 

0= 0 0 0 , 11= 0 1 0 , 

Vo 00/ Vo 0 \) 

/0 0 0 \ / 0 0 1\ 

S 1 = i I 0 0 -l) , S 2 = i I 0 0 0), S 3 =i 

Vo 1 0 / V-l 0 0/ 

K' = (1 0 0) , K 2 = (0 1 0) , K 3 = (0 0 1) , 

0 = (0 0 0) . 


0-10 
1 0 0 

0 0 0 


(15.46) 


We convince ourselves of the equivalence of (15.44) with (15.34) and (15.35) by 
showing that the ten equations do indeed coincide. For example, the first line of 
(15.44) yields 


iy/moc (d°ip l — d l ip° — G 01 ) = 0 , 


i.e. just the /j, = 0, v = 1 component of the Proca equation (15.34). As a second 
example we consider the tenth line of (15.44): 

2 

ih(d\Go\ + ^2^02 + 53 G 03 ) H j — = 0 . 

a 

Because of the antisymmetry of G^ u it can be also written as 

9 1 G 10 + a 2 G 20 + ftG 30 + ^ V ° = 0 , 

nr 

which is just the v = 0 component of the inhomogeneous Proca equation (15.35). 
The remaining components of (15.34) and (15.35) can be verified in an analogous 
manner. Now the above defined (3 matrices fulfill the “commutation relations” 

+ (3 y (3 x (3^ = + g vX ^ (15.47) 


(as can be shown easily by calculation, cf. Exercise 15.7), and these define the 
so-called Kemmer algebra. If one passes to another representation (3 f = Sf3S t 

3 N. Kemmer: Proc. Roy. Soc. A 177, 9 (1939). Sometimes the name Duffin-Kemmer equa- 
tion or Duffin-Kemmer-Petiau equation is used in the literature. As is often the case in 
the history of physics, several researchers almost simultaneously studied the same topic, 
here the spin-1 fields. 
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of the 0 matrices by a unitary transformation S, then of course, formally neither 
the commutation relations (15.47) nor the equation of motion (15.44) for the field 
x' — Sx (called the free Kemmer equation ) change. Obviously it is fully equiv- 
alent to the Proca equation, and this is also the case with regard to the coupling 
of spin-1 mesons to the electromagnetic field . 4 * * Therefore in the chosen standard 
representation (15.46) the reverse transformation of the Kemmer theory into the 
Proca form can be constructed. If we define with 

t/ M : - - ( ft ) 2 ( ft ) 2 ( ft ) 2 ( ft ^° - 9 M °) , 

: - U ii p u = -U v/i (15.48) 


ten additional 10 x 10 matrices, and with 


/°\ 


E = 


! 


a 10 -component spinor, we obtain 

<p v (x) = —£L=E'U v x(x) 

y/m 0 c 


(15.49) 


(15.50a) 


G» v {x) = ^nwE'U^xix) • (15.50b) 

If we pass over to another representation ft = SfiS\ then 

U'=SU U S' , =SU M „S j , E' = SE (15.51) 

holds accordingly. In other words: the Kemmer equations cannot only be repre- 
sented in the form (15.44), (15.46) but also in many other equivalent representa- 
tions. (In the following we shall follow these ideas further.) Also in the Kemmer 
form the spin- 1 fields posses a Lagrangian density of the form 

^Kemmer = - m 0 CXX , (15.52) 

where one defines with 77 = 20$ — 11 

X = X f »7 (15.53a) 


and 


- d M 


(15.53b) 


4 This was shown by Max Riedel in his diploma thesis (Frankfurt University, 1979) who also 

discussed other spin-1 theories and their relations to the Proca/Kemmer theory. Tragically, 

he died soon after its completion. 
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15.5 The Maxwell Equations 

If the mass of the Bargmann-Wigner particles vanishes, we have to take into 
consideration the chiral symmetry if we want to deduce a theory which is analogous 
to the Proca equation; therefore we have to decompose the fields into eigensolutions 
of the chirality operator 75 for each index (remember the explanations in Sect. 15.2). 
This can be realized formally by requiring that for the corresponding symmetric 
4x4 matrix it holds that 

i/ry • d(a + f3~fs)\P(x) =0 , 

!f r (x)(a + /?7 5 ) i/i7 T • d = 0 , (15.54) 

and that for arbitrary numbers a and (3, if \P(x) is additionally an eigenfunction to 
75 with the eigenvalues +1, then (a + (3~fs) reduces to the number ( a ± (3) and 
(15.54) reduces to the Bargmann-Wigner equations with vanishing mass (15.14). 


Now the ansatz for the solution, in contrast to (15.29), can only read 

&(x) = , (15.55) 

since 7 5 a^ u C is certainly symmetric, though not so, however, 7 5 7 /i C. From both 
equations for & it then follows that 

\\h la (a + £75) u^d a G^{x)C = 0 , 

\iha fiU (a + /3'y 5 )'y a d a G >lu (x)C =0 . (15.56) 

Because of 

75^" - \xe? v °°& aa (15.57) 

(see Exercise 15.20) and (15.40), this can be written as 

^ih'y s v (lL ' + ^f3d 6 s ae ^G,^ C 

= ^7*^" {ad s G^ + i pd s G^) C = 0 . (15.58a) 

Analogously the transposed equation reads 

( ad S G» V + Wd s G^) C — 0 , (15.58b) 

and the difference reduces to 

-2 h (ad^G^ + iPd^G^) 7„C — 0 . (15.59) 

The coefficients of the linearly independent matrices 7 must all vanish, in this 
case, additionally, for arbitrary numbers a, (3. If we set a = 1, f3 = 0 or a = 0, 
f3 = 1, eight Maxwell equations result: 

d^G^ = 0 , (15.60a) 

d^G^ = 0 . (15.60b) 


The homogeneous Maxwell equation (15.60b), which according to (15.39) reads 
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d e G^ + QH G »e + ff'Qen. = o (15.61) 

can be fulfilled identically with the ansatz 

G M " = S'V' - S'V'* ; (15.62) 

however, we find no way to fulfill the auxiliary condition (15.37b) as we did for the 
Proca field. The field ^ /t has been introduced here just as a supplementary quantity 
and has no physical meaning, this supplementary character being emphasized by 
the fact that its choice is not unique. The so called re-gauging 

<£V(*) -> v'pix) = - d^Aix) (15.63) 

with an arbitrary scalar field A(x) leaves the physical field unchanged, as long 
as the very general condition 

(d^dy - d^dfj) A(x) = 0 

holds. The variety of fields ip ,l (x) can be reduced if we define additional conditions 
like (15.37b), which select a specific gauge. Equation (15.37b), which plays an 
important role in electrodynamics, is called the Lorentz gauge. But using this gauge, 
the field <^ M (x) is still not uniquely characterized, because we can define a new 
field (p ,fi such that 

= 0 (15.64) 

for the case CM = 0 ! A different choice of gauge would be to demand <p° = 0 in 
addition to (15.37b) ( radiation gauge), but obviously this gauge does not lead to 
a unique choice of the vector field tp^, either. 

In the following exercise and examples we will further deepen our understand- 
ing concerning the various ideas presented here. 


exercise 

15.7 Commutation Relation of Kemmer Matrices 

Problem. Verify the commutation relation (15.47) for the Kemmer matrices (15.45). 

Solution. We present the solution for three examples: 

(a) All indices are equal (p = v = A = 0). 

(b) Two indices are equal (p = u = 1, A = 2). 

(c) All indices are different (p — 0, v = 2, A = 3). 
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(a) (/j, = v = A = 0): First we calculate 


pP(3° = (f3 0 ) 2 = 



0 

0 

-H 

(A 



0 

0 

-n 

0 + \ 


0 

0 

0 

o f 



0 

0 

0 

o* 


-n 

0 

0 

o f 



-n 

0 

0 

0* 


0 

0 

0 

0 I 



0 

0 

0 

0 / 


/n 0 0 o f 

' 0 0 0 0 f 


n o f 


Vo o o o 


and then 


/ 


( 3 ° (^°) 2 = 


-B 0 f \ 
0 0 f 
0 0 f 


/ n 0 0 o f \ 
1 0 0 0 0 f ’ 


0 0 / 
-n o f \ 
0 0 + 

0 0* 

0 0 ) 


0 


n o t 


Vo o o o / 


= P° 


0 

0 0 
-1 0 
V o o 
/ 0 0 
0 0 
-n 0 
\ o o 

Thus we get 

/? 0 / 3 °/?° + / 3 °/ 3 °/ 5 ° = 0 ° + = g 00 /? 0 + g m / 3 ° 

(b) (n = v = 1, A = 2): Once again we begin with the product 


/3 2 /? 1 = 


/ 0 0 

0 /s: 2 t\ 

/ 0 

0 

0 



’ 0 0 

—iS 2 0 f 

0 

0 

-iS 1 

o f 

0 iS 2 

0 0 f 

0 

iS 1 

0 


o f 

\K 2 0 

0 0 / 

W 1 

0 

0 

0 / 

K 2 ^ ® AT 1 

0 

0 

0 f 

\ 



0 

-iS^S 1 

0 

0 f 




0 

0 — iS 2 iS* 

0 f 




0 

0 

0 

K 2 K^ ) 




( 1 ) 


( 2 ) 


( 3 ) 


( 4 ) 


Here products of the matrices S' from (15.46) occur, especially 
/ 0 0 1 \ /0 0 0 \ /0 1 0 \ 

iS 2 iS ! = 0 0 0 0 0 -1 =10 0 0) 

V-l 0 0/ Vo 1 0/ Vo 0 0/ 

= M {n) = K^®K 2 , (5) 

where “<8>” denotes the tensor product of the vectors and K 2 . With the thus 
defined 3x3 matrices M (J ’\ we now have 
( 0 0 0 AT lf \ 


/ 3 l (/A 1 ) = 


0 0 

0 iS 1 

\k' 0 


-i S 1 

0 

0 


o f 

o f 

0 ) 


{ 


0 

0 




0 

0 

-iSMf< 12 > 

0 


/a/ (21) 
0 
0 

\ 0 
0 


0 

-M (12) 

0 

0 


0 

0 


0 f \ 


0 f \ 

is 1 ^ 12 ) o f ' 


o f 

_M(12) Q f 

0 0 ) 


0 

0 


0 f 

o y 


( 6 ) 
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and hence 


/0 0 O' 

K l M m = (1 0 0 ) ( 1 0 0 

Vo 0 0, 


- 0 


( 7 ) 


and 


= - 



( 8 ) 


The rhs of (6) contains the 10 x 10 zero matrix O\ 0x 10 ; hence the final results reads 


0 W + Z ? 1 /? 2 /? 1 = 0 , 0 X 10 = S 12 /? 1 + 9 n {? . 

(c) /j, = 0, v = 2, A = 3: Evaluating the product of the /? matrices 


/ ® K 2 


?(3 2 = 


V 


o 


0 

-iSHS 2 

0 

0 


0 

0 

-iS_ 3 iS 2 

0 


0 T \ 
o f 
o f 
0 ) 


the products of the 3 x 3 matrices 

/0 -1 0\ / 0 0 1 

1 0 0 0 0 0 

Vo 0 0 / V-l 0 0 

= m (23) = k 2 ' 


iS 3 iS 2 = 



occur. Because 

/ 0 0 1W0 -1 0 

0 0 0 111 0 0 

V-l 0 0/ Vo 0 0 

= M (32) = <g> K 2 = M (23)t 


iS 2 iS 3 - 



holds, we obtain 

Z? 0 /? 3 ^ 2 + Z? 2 Z? 3 /3° 



0 0 

-n o f \ 


/ M (32) 


0 



0 

6 f \ 



0 0 

0 o f 


0 

-m (23) 


0 

o f 


- 

-1 0 

0 0 + 


0 


0 


- 

-Af (23) 

o f 



0 0 

0 0/ 


l 0 


0 



0 

o / 



m (23) 

0 

0 

o f \ 


( 

0 

0 - 

-n o f \ 


0 

—M (32) 

0 

o f 



0 

0 

0 0 


+ 

0 

0 

- 

-m (32) 

o f 


- 

-n 

0 

0 0 



o 

0 

0 

0/ 

\ 

0 

0 

0 0/ 


0 

0 

-m (32) 

0 


0 m ( 23) o f \ 
0 0 0 f ' 


o f 


0 

Vo o o o ) 
= Oioxio = 5 03 /? 2 + 5 2 V 3 • 


+ 


/ 0 0 

0 0 

M® 2) 0 

\o o 


-a/ ( 23) o f \ 


0 

0 

0 


o f 
o f 
0 / 


( 9 ) 


( 10 ) 


( 11 ) 


( 12 ) 


Exercise 15.7. 


( 13 ) 
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EXERCISE 


15.8 Properties of the Kemmer Equation Under Lorentz Transformation 


Problem. Discuss the properties of the Kemmer equation under Lorentz transfor- 
mations. 


Solution. From our discussion in Chap. 3, we know that there has to be an explicit 
prescription which allows for an observer A to recalculate a wave function xOO of 
the Kemmer field into a wave function x'( x> ) which is the wave function observed 
by B, who is at rest in another inertial frame. Following the relativity principle, 
x'{ x> ) is the solution of an equation, which is of the form 

(i/^d' M - m 0 c) xV) = 0 (1) 

in the system of observer B, if x(*) is a solution of the Kemmer equation 
(15.44). Because the Kemmer matrices ft must also obey the commutation re- 
lations (15.47), from which they are defined up to a unitary transformation, we can 
write without loss of generality: 

& = h • ( 2 ) 

Covariance of the Kemmer equation means that simultaneously (1) and (15.44) 
hold, where primed quantities refer to the inertial system of observer B, which is 
connected to the inertial system of A via the Lorentz transformation 


X f — CL V X — CL U $ 

A /i, > U fi a M U u ? 

[see (3.1), (3.3)] where a is an orthogonal matrix [see (3.4)]. 
connecting x with x we write 

xV) = 5(d)x(x) , 


( 3 ) 

As an ansatz for 

( 4 ) 


and vice versa [see (3.27-28)]: 

X{x) = S-\a) X \x') . (5) 

Inserting (5) into (15.44) and after multiplication with S(a ) one gets 

(ihS(a)l3 fl S- 1 (a)d^ - m^c) X \x') - 0 , (6) 

and then, because of (3), for the primed system B it holds that 

(ihS(a)f3 li S~\a)a li u d ,l/ ~m 0 c) X > (x , ) = 0 . (7) 

In order that (7) is identical with (1), 

5 _1 («)/V^) = (8) 


must hold, and from this equation S(a ) can be determined. To reach that goal we 
start, similary to the procedure described for the Dirac field in (3.36-3.68), with 
the infinitesimal proper Lorentz transformation 
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“I - , (9) 

where 

Aujpv <C 1 • 

Because of the orthogonality (3.36a) of the Lorentz transformation, 

= —Auivu -\- O (10) 

holds. Expanding S(a) up to first order in gives 

S{a) = l- l -Au tJL J» L ' (11a) 

or 

S- 1 (a) = 5(a" , )= (lib) 

with the unknown 10 x 10 matrices / ^ (H is the 10 x 10 unit matrix). As in the 
case of the Dirac field a formally similar calculation yields 

[P\i^]_ =-2i(s A 'V'-<7 A V) ■ 02) 

An additional determining equation for the generators / ^ follows from the relation 

S(a)S(a')S-\a) = S(aa'a-') , (13) 

which states that S(a ) is a representation of the Lorentz group on the vector space 
spanned by the Kemmer algebra. For infinitesimal Lorentz transformations it fol- 
lows that 

[1^, I aP ] _ =2i {g ua I^ - + g^I™ - g^l v P) . (14) 

The relation (12) and (14) are fulfilled by the antisymmetric matrices 

7 MI/ = 2i (/J^/? 1 ' — /?"/? M ) , (15) 

which can be easily verified by use of the commutation relations (15.47). This result 
occurs in direct analogy to the matrix o tiU in the Dirac theory [see (3.42)]. The fac- 
tor “2” in (15) gives rise to the fact that the Kemmer field transforms to itself under 
a spatial rotation of 27 t, and not Air, as is the case for the Dirac field (c. f. Chap. 3). 
The problem being solved for infinitesimal proper Lorentz transformations, we now 
look for the finite transformations which can be constructed by repeated application 
of the infinitesimal Lorentz transformations. With Alj^ = it follows that 

( i ui )xl ' \ N 

'"ii r ? »-j 

= exp V'*) = Siayxil) . ( 16 ) 

With this the transformation properties of the Kemmer field x( x ) under proper 
Lorentz transformations are known. Now we concentrate upon the improper 
Lorentz transformations a ,n \ which can always be written as the product 


Exercise 15.8. 
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= b t * a a a ‘ / , 

where the matrix b^ u describes either space or time inversion, or both, i. e. 

b=p , t , pt . 

In this connection 


P*u 


/I 0 0 O' 

0-100 
0 0-10 
Vo o o -l. 


= 9^ 


[cf. (4.2)] and 


/-I 0 0 O' 

' 0 1 0 0 

0 0 10 

V o o o i 


= -<r 


(17) 


(18) 


(19) 


( 20 ) 


Since the spinors x(x) lie within the representation space of the Lorentz group, 


then 


S0) = S(ba) = S(b)S(a) (21) 

must hold again, i. e. it is sufficient for us to find S(p) and S(t). All other transfor- 
mations are then calculable as products. Since p and t are discrete transformations, 
they cannot be obtained from infinitesimal transformations and we must directly 
start with (8): 

S- l (p)/3^(p)^p^P, = Y,9 tl, 'P ,/ • ( 22 ) 

V 

By means of the Kemmer algebra (15.47) and r\ defined in (15.53a), one verifies 

that 


77/^77 = (23) 

V 

is true and with that we obtain the parity operator of the Kemmer theory: 

P =S(p) = r]e l,p , (24) 


with arbitrary <p e R. 

In the case of time inversion (cf. Chap. 12) we must start with the Kemmer 
equation with minimal coupling, thus with 


'ihp* (dp + i e -A^ 


— nioc 


X(x) = 0 


(25) 


We designate the time-inverted wave function Xt( x ') and require it to obey (25) 
in the transformed coordinate system; thus 




- ni§c 


XT (x') = 0 


(26) 
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In this connection, because of (20) and (12.41a) and (12.52,53), 

^ = t lk v d v , (27) 

4, = -t/A, , (28) 

The additional sign in (28) can be cancelled by complex conjugation. Thus (25) 
implies that 


-ih ^ - m ° c = 0 


(29) 


If we define the matrix t by 

XT (x') = t X \x) , (30) 

this implies that 

[-ih (< d ; + i eA’p) - moc ] X t(x') = 0 . (31) 

which coincides with (26) if 

tfltt- 1 = -tSPp (32) 

is true [in complete analogy to (12.56)] for the Dirac matrices. Now in the repre- 
sentation (15.47) 


0; = , 03 ) 

holds, which can easily be checked, i. e. we can choose 

t = \e v ‘ , (34) 


with arbitrary <p' € R. The time-reversed wave function of the Kemmer theory is 
thus (up to an arbitrary phase) given by 

Xt(x') = x*(x) • (35) 


Exercise 15.8. 


EXERCISE 

15.9 Verification of the Kemmer Algebra 

Problem. Show that the Kemmer algebra (15.47) can be realized by the (16 x 16) 
matrices 


( 1 ) 
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Solution. Instead of (1) one often writes in an offhand manner 

^ = \ (tm + y M ) , (2) 

where besides 

{7m > 7m} = 2<7m v = { 7^.7^} i (3) 

also 

h^y ]_ =o 

must be required, too. [Strictly speaking = 7 ^ 0 D and 7 ^ = 7 M 0 1, where H 
is the unit matrix of the Clifford algebra (3) in four dimensions.] Now, because of 
(2), we have 

PxPufiv = | (7a7m7m + Jxl'Wu + 7a 7m 7m + 7a7{,7m 

+7a7m7m + ixitfl'v + 7a7m7m + 7a 7m 7m) • (4) 

Adding PuP^Px to this product results in 

PxPuPv + PuPvlPx = g {ixlulv + Ivlulx + jWn.ll + jWWx 
+ 7a {7,L< 7m} + 7a {7m> 7m} + 7m {7a, 7m } 

+7m{7a.7m}+7m{7a,7m} + 7m{7a.7m» , (5) 

and since 

JxJhJu = {7a , 7m I 7m - 7m {7a , 7m } + {7m . 7m } 7a - 7m7m7a (6) 

is valid, as well as the corresponding terms for the product jW/jJI’ t00, we obtain 
[because of (3)] 

PxWv + PuP,iPx = g (4 pa m 7m + ^9hvJx + 4 5am7m + 4 0mm7a) 

= 9Xfj.Pi/ T 9/ji/Px , (7) 

and thus (15.47). 


EXERCISE 


15.10 Verification of the Proca Equations from the Lagrange Density 

Problem. Show that the Lagrange density £p r0 ca from (15.38) leads to the Proca 
equations (15.34) and (15.35) and their complex conjugates. 


Solution. In this notation of the Lagrange density the fields G >1V , G M "*, and 
are conceived as independent. A variation with respect to G* u yields then 
because of the definition (15.34) of G 


dC 

dG 


;mm 


1 


& , <p v - d v p tx = 0 


\LV 


Variation with respect to tp* results in 


dC 


du- 


dC 


2„2 


m 0 C 

h 2 


W + ^<9„(G Ml/ - G l ' M ) - 0 


dpi d(d u tp*) 

i. e. the equation of motion (12.35). The complex-conjugate equations are obtained 
analogously by variation with respect to G^u and tp^. 
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EXERCISE 


15.11 Conserved Current of Vector Fields 

Problem. Determine the conserved current for the vector fields. 

Solution. The Proca equation (15.35) holds for the field the corresponding 
complex conjugate equation 


2.2 


d»G 


liv 


mfic 

~~h? 


-<Pv 


( 1 ) 


holding for the field G* u . If we multiply (15.35) by p* and (1) by ip v , then the 
result is 


(2a) 

(2b) 


= 

ITIqC 2 


K 2 

<pud»G; u = 

m^c 2 

v V( p\ 

h 2 


Subtracting (2a) from (2b), we obviously get 
-^^ + ^8^=0 . 

This can be written in the form 

3 ^ = 0 . 

where the conserved current is given by 

h = ( P VG tv ~ Outfit • 

The reason is that the terms in braces in 


( 3 ) 


( 4 ) 


( 5 ) 


3% = ^d^G; u - (c^G/) <pl + {(0V)G*„ - G^d^*} 

equal zero because of the antisymmetry of G ^ and G*„, which follows from the 
definition (15.34), i.e. 

d^^c; u - G^av* = 5 [O'V - - G^(d^ u * - av*)] 

= \ [G^C;, - G^G^*] = 0 . (6) 


exercise 

15.12 Lorentz Covariance of Vector Field Theory 

Problem. Investigate the Lorentz covariance of the vector field theory. 

Solution. In order for the Proca equations to transform covariantly under Lorentz 
transformations, the field must transform like a vector, i. e. under the infinites- 
imal Lorentz transformation [cf. (3.1) and (3.53)] 

x 'ti = a » X v = , 


( 1 ) 
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it follows that 

= A^x) + Alj/A v (x) 


( 2 ) 


must hold, too. On the other hand the Lorentz transformation of a field can generally 
be written [in analogy to (3.27) of (3.39)] in the infinitesimal case as 


<(*') = + 


ol(3 




(3) 


Each of the matrices must be antisymmetric, because the Auj^ u are antisym- 
metric too [cf. (3.36)]. The comparison of (2) and (3) yields at once that 


- a “ g g\) 


' ol(3 

For pure rotations, for instance, one obtains 

/ 0 +1 O' 

fi n )ki = Wk9 2 i — g'ig 2 k) = ( — l o o 

0 0 0. 

0 O' 

0 -1 

+ 1 0, 

/ 0 0 - 1 ' 

(^ 31 ) t; = - {g 3 kg l i — g 3 ig'k) = ( oo o 

V+l 0 0 , 


(4) 


\ 

(i 23 )k, = -(s 2 kg i l -g 2 I g\) = ( 0 

Vo 


(5) 


The indices k and l here take only the values 1,2,3 and these matrices can be 
conceived as the spin-1 analogues of the Pauli matrices [see (1.65)]. 


EXAMPLE 


15.13 Maxwell-Similar Form of the Vector Fields 


The additional condition (15.37) of the Proca theory implies that the four- 
components of are not independent dynamic variables. To isolate the inde- 
pendent quantities, we introduce the three-dimensional fields: 


B'=~k £ ijk Gjk /,_/,* = 1,2,3 


E' = -G 0i = G'° 
so that we can write 

0 —E l 

I 


E' 

E 2 

Ve 3 


0 

fi 3 

-B 2 


-E 2 

-B 3 

0 

B l 


- £3 \ 
B 2 

-B l 

0 1 


= -G v » 


In this notation the Proca equations (15.34) read: 

b‘ = -y*G jk 

= -\e iik {d m - d k pj) = (V x tpf , 


(la) 

(lb) 


( 2 ) 


(3a) 


15.5 The Maxwell Equations 


375 


E‘ = — G 0 ' = -dPtpi + &<p° = 


1 d_ 
c dt 


< p + 


Using the inversion of (la), 

-e^B, = y kl e lmn G mn = \ {<f m g k n - g k m g i n ) G mn 
= \{G jk -G kj ) = G ik , 


(3b) 


( 4 ) 


one gets the following relation: 


d,G ,J = d'£ iik B k = -e jtk diB k = (V x By 


doG® = . 

c at 

Therefore the spatial part of (15.35), with v = j, is given by 


1 d 


2J2 


mfic 




■<P 


The temporal part, however, (u — 0) reads as 


(5a) 

(5b) 


( 6 ) 


diG i0 = V x E = . (7) 

Equations (3b) and (6) are the true equations of motion, because they contain the 
time derivations of <9 0 ip and d 0 E , whereas (3a) and (7) can be seen as definitions 
of the fields B and ipo, respectively, expressed by the independent fields (p and E. 
Substituting the dependent fields into the equations of motion by their definitions, 
one obtains from (3b) and (6) that 


- = -E + -fr V(V • E) , (8a) 

c at m^c 1 

1 d ^ mlc 1 ^ 

c9? £ = + ir V + VX(VXV> 

= (-V 2 + <f + V(V x If) (8b) 

In the last conversion we used the relation 

[^7 X (V X tp)]i — ZijkdjSjclmdlPm = (^li^mj ~ &lj 

= did m tp m - d,d m = [V(V • tp) - vVli . (9) 


Example 15.13. 


exercise 

15.14 Plane Waves for the Proca Equation 

Problem. Find the solution of the Proca equation (15.36), with the additional con- 
ditions (15.35) and (15.37) of the form 

(i. e. plane waves) where N p is a normalizing factor. 


( 1 ) 
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Solution. First of all, from (15.36) and (15.37) it follows that 

(p 2 - mlc 2 ) e£ = 0 . ( 2 ) 

This means the functions can only be different from zero if 

P° = ±X ~ e p > (3a) 

E p = +c^Jp 2 + m$c 2 . (3b) 

Additionally, condition (15.35) implies that 

= ° • (4) 


Now for every three-vector p and each sign of p°, a set of three linearly independent 
four- vectors e^ x (A = 1,2,3) can be constructed which fulfil (4). Let £\ (A = 
1,2,3) be an arbitrary “tripod” with 


ex ■ ey = &XX' 


(5) 


which, in the rest system, is obviously valid. Now we Lorentz transform to a system 
which moves with the velocity -p/p° = - v/c . Then the three-vectors £\ convert 
to the vectors £^ x with the components 




1 


\/l — V 2 jc 2 C 

e p \ = + 

— £ x + 


V p-£ X 

— ' £x — 

mo c 


y/l — v 2 jc 2 — 1 (v 


v 2 /c 2 
(p ■ £x)p 


l (V \ V 

-{c- e *)7 


moc(p° + m 0 c ) 


( 6 ) 


These of course obey the covariance condition 


P-£ p x =0 


(7) 


because the latter does not depend on the reference system and in the rest system 
p = 0 and 0 ,a = Furthermore the following orthogonality is valid: 


£p\ ’ £p\' 

= (P ' £ x)(P ' ey) 
mfic 2 

(P ■ £x)(p ■ ey) 


ex + 


mlc 2 
= ~£x ■ ey + 

= —f>XX’ , 


ex • ey + 


p(p • £ a) 
moc(p° + moc) 
2(e A • p)(e x 


ey + 
P) 


p(p-ey) 
moc(p° + m 0 c) 

P 2 (P ' e\)(p ■ £y) 
m 0 c(p° + m 0 c ) ' mlc 2 (p° + m 0 c ) 2 _ 


+ 


(p ■ £ X )(p ■ £y) 


mlc 2 


1 - 


2m<)C 


P - m 0 c 


p° + m 0 c p° + m 0 c J 


because the bracket yields 

p° + m 0 c - 2 m 0 c - p° + m 0 c 
p° + m 0 c 


( 8 ) 
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Equation (8) is evident in the rest system, and, due to the invariance of the four- 
scalar product under Lorentz transformation, the explicit calculation of (8) was 
not actually necessary. Together with the time-like vector the three space-like 
vectors e^ )X constitute an orthonormal quadrupod in Minkowski space. 


EXERCISE 

15.15 Transformation from the Kemmer to the Proca Representation 

Problem. Verify the reverse transformation (15.50) from the Kemmer to the Proca 
representation. 


Solution. The matrices U ^ and U^ v in our representation are given by 10 x 10 
matrices: 


U° = 


U k = 


u k0 = 


u kl = 


/0 0 0 0 \ 

0 0 0 0 
000 OM ’ 
Vo o o - 1 / 

/ 0 0 0 oj\ 

0 0 0 0 

0 0 0 O' 

Vo 0 -i K k 0 / 

/ 0 0 0 0j\ 

0 0 0 0 f 
0 0 0 0 f 
Vi K k 0 0 0 / 
/0 0 0 0 f \ 

0 0 0 O' 

0 0 0 0* , 

Vo i K k S l 0 0 / 


(la) 


(lb) 


(lc) 


(Id) 


Now, for example, we have 


i h 


x/m oc 


E^U° X = + 


i h 


y/m 0 c 


Xio = <P 


,o 


( 2 ) 


because the spinor E t of (15.49) picks out exactly the lowest component of U°x 
[using (15.42) and (15.43) in the process]. In the same way 


i h 


-.E^U'x = 


-Xi = V 


y/ rn 0 c x'moC 

etc. are valid. On the other hand, for example, 

v / m^c£' t f/ 10 x = iy/nwxi = G 01 


(3) 


(4) 


or 
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y/m^cE^U 23 x = i v / ™<^(iX 4 ) = G 23 


(5) 
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holds, because 

/0 -1 0 \ 

K 2 S 3 = i(0 1 0) 1 0 0 = i(l 0 0) = iK 1 , (6) 

VO 0 0/ 

etc. Now it is clear that the transformation (15.50) does give exactly the components 
(15.43), so that, indeed, (15.50) is the reverse transformation. 


EXERCISE 


15.16 Lagrange Density for Kemmer Theory 

Problem. Convince yourself of the correctness of the Lagrange density (15.52) of 
the Kemmer theory. 


Solution. It has to be verified that the Euler-Lagrange equations 
d£_ dC 
dx ^d(d^x) 

are equvalent to the Kemmer equations (15.44). Now with 
dC 1 

0= = ~ m 0 cx 

and 

dC 1 „ 

did 2 lhPflX ’ 
it follows immediately from (1) that 
(ihp^ - m 0 c) x = 0 , 

i. e. (15.44). Alternatively we may write: 
n _ dC dc 

9 X ^(d^x) 

_ 1 _ <- 1 _ 

= -mocx - ^hxPfid* 1 - d^-ihxP^ 

= ~X {woe + = 0 , 

the equation for the adjoint spinor x- 


( 1 ) 

( 2 ) 

( 3 ) 


( 4 ) 


example 

15.17 The Weinberg-Shay-Good Equations 

There is yet another six-dimensional spinor representation of the Proca theory, first 
given by Weinberg in 1964 and later investigated further by Shay and Good. 5 To 


5 D. Shay, R.H. Good: Phys. Rev. 179, 141 (1969) and S. Weinberg: Phys. Rev. 133, B1318 
(1964). 
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substantiate it one can proceed in a completely analogous way to that of the deriva- 
tion of the Kemmer equation. In order to do this we define the six-dimensional 
“spinor” 


X = 


'Xi 


' X6 • 


which is connected to the Proca field by 

*' = 3i( G23 - iG °') ' 

« = ^( G ' ! + iG ” 2 ) • 
«^( G,2 - iG ° 3 ) ■ 

^ = -4^( G23+ ' G °') ' 

*’ = ^( G ' 3 - iG02 > ’ 


X6 = -- 


1 


(G 12 + iG 03 ) 


4moC 

With the notation (cf. Exercise 15.13) of the field strengths 

£' = G'° , 

B, = e ,jk G ik 


equation (2) reads 
1 


Xi 


4moc 


(B 1 -iB 1 ) 


X4 = - 


Arrive 


(B 1 + if 1 ) 




X3 


1 


Arrive 


(B 3 - i£ 3 ) 


« = -4^( B2 + i£2 ) 
(B 3 + if 3 ) 


X6 = - 


4moc 


( 1 ) 


( 2 ) 


(3) 


(4) 


(cf. also Exercise 2.1, where the components X 4 > Xs> X6 are unnessary because the 
Maxwell field is real). The field equations are given by 

[^ u {\hd^)(ihd v ) - (ihd^)(ihd^) + 2mlc 2 ] \(x) 

= [ihd^ - gnit&u + 2 m 2 c 2 ] x = 0 , (5) 

where the 6x6 matrices are given by 


i 1 = 


0 

Sijl + MW+AfW 


Sij n + M (ij) + M {],) 


= i l 


7 °< = yo 


0 s‘ 

-S' 0 


7 


,oo 


0 11 

n 0 


( 6 ) 


Example 15.17. 
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The definitions of the 3x3 matrices S l , 0 and 1 can be taken from Ex- 

ercise 15.7 (commutation relation of the Kemmer matrices) and (15.46). That the 
Weinberg-Shay-Goou equations are equivalent to the Proca equations cannot be 
shown as easily as in the case of the Kemmer equations, because first one has 
to give a suitable linear combination of the six Shay-Good equations, which then 
lead to the Proca form. This algebra problem can be most easily presented by first 
finding the inverse transformation, which is then used to reconstruct the Proca field 
from the spinor y. 

Here we proceed in close analogy to the Kemmer theory. In the same manner 
as in (15.48) and (15.49), the antisymmetric 6x6 matrix U^ v is defined by 

U« = \et k B k , 

U 0J = \\Al , (7) 

where 


A' = 


B l = 


a/ po 


M (3i) -M (3/) 


( 8 ) 


are 6 x 6 matrices, too. t he six-component spinor h. is defined as 


/°\ 


E = 




Hence, analogously to (15.50), the following relation holds: 


(9) 


G M "(*) = — —E'U^xW ■ (10) 

2mnc 

Applying this matrix from the left to the Weinberg-Shay-Good equation (5) one 
has to take note of the fact that 


[/Miy*/? = - g av U^ + g /3fi U ai ' - g 0u U a ») (11) 

holds. Consequently we find that 

—E^U^ [i hd a ( 7 a/3 - g af3 ) i hdp + Im^c 2 ] x = 0 , (12) 

or 

— — \ihd a {g atl U 0v - g av U ^ + g^U av - g 0l/ U a ^) \hd B 1 \ 

2moc L ' 

= 2 ml^G^ . (13a) 

The lhs of this equation reads as 

i hd^ihdgG^ - ihd u ihdgG^ + ihcr\hd 0 G 0u - m u \hd 0 G^ 

= 2 itiqC 2 — d u (p tl ) , 


(13b) 
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defining the field by 

V? = — • O 4 ) 

ITIqC 1 

Equations (13) and (14) agree with both of the Proca equations (15.33) and (15.34). 
The Lagrangian of the Weinberg-Shay-Good theory is written as 

Avsg = h 1 (duX) (7 M " - 9 tiu ) + 2 me 2 XX , (15) 

where 


X =: X f 7 


00 


(16) 


and this is examined in more detail in Exercise 15.18. 

Besides the representation used here there is another interesting representation, 
in which the spinor x has a simple form. Defining the unitary matrix 


S = 


x/2 VI 



we get 


x! = s x = 


_v/2_ 

Aiuqc 



(17) 


(18) 


This spinor satisfies (5) if one inserts the transformed matrices 
= 57^51 . 


(19) 


In (10), naturally and E have to be substituted by and E'\ 
u'^ = SU^S* , E' = SE . 


EXERCISE 

15.18 Lagrangian Density 

for the Weinberg-Shay-Good Theory 

Problem. Prove that the Lagrangian of the Weinberg-Shay-Good theory [Exercise 
15.17, (15)] yields the proper equation of motion. 

Solution. With the help of the Euler-Lagrange equation we obtain 

0 = |= - 9 » Jif-, = 2m o c2 X ~ - 5 m ")^X (1) 

ox O(o nX) 

[cf. Exercise 15.17, (5)]. The adjoint equation is obtained accordingly as 

o /» rj /> 

0 = -5 — n = 2m o c2 X ~ rfd^xh^ ~ 9 M ") 

OX o(d^x) 


Example 15.17. 


( 2 ) 
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15.19 Coupling of Charged Vector Mesons 
to the Electromagnetic Field 


The complex spin-1 fields appearing in the Proca, Kemmer, Weinberg-Shay-Good 
equations describe charged vector mesons. To let the equations of motion remain 
invariant, if we apply a local gauge transformation to the fields, the derivation d y , 
must be substituted by the “gauge-invariant derivation” 

£ 

T i~^l^ > 

where e is the charge of the vector field [cf. (1.132-136)]. Therefore the field 
equations of the three different theories read: 


+ 


2 2 
m<j(r 


h 2 


ip 1 ' = 0 


(la) 


G* u/ =D*<p v -D v <pF , Proca 


(lb) 


(\hD^(3 ,Jj — moc) x( x ) = 0 , Kemmer 


( 2 ) 


[ft 2 Z) M ( - g^ u )D u - 2 m^c 2 ] x( x ) = 0 , Weinberg-Shay-Good . (3) 

Since observable massive charged mesons are not elementary particles, but com- 
posed of quarks, as are the nucleons, we expect an anomalous coupling [in the case 
of the proton (cf. Exercise 9.11)]. In the simplest cases it can appear in form of a 
dipole or quadrupole coupling like 

-^dipole ~ b * 1 F [11/ •> ^quadrupole ~ c^dxF^ . ( 4 ) 

The tensors b ^ and are bilinear forms in the vector fields formed with 
matrices of the algebra of the correspondling theory. 

In the case of free vector mesons we saw that the Kemmer, as well as the 
Weinberg-Shay-Good equations can be brought into Proca form. In doing this with 
minimal and anomalous couplings, the algebra becomes more involved. In these 
calculations one finds that the anomalous couplings in the Proca equations transform 
into the corresponding anomalous terms of the Kemmer equations and that the 
minimal coupling of the Proca theory is equivalent to that of the Kemmer theory. 
The Weinberg-Shay-Good theory exhibits different behaviour: after transformation 
into the Proca form the minimal coupling in £wsg creates an anomalous dipole 
moment. However, since the Proca theory cannot exclude an anomalous dipole 
moment, a measurable difference of the three theories cannot be found (at least in 
the first order of the coupling constant a = e 2 /hc). 
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EXERCISE 


15.20 A Useful Relation 


Problem. Prove the relation 

75 ^ u = \ie^a ae . 


Solution. To prove this relation we take the standard representation, for which it 
holds [cf. (2.13) as well as (3.57)] that 


75 = 


o n 
n o 


a* = if 0 *' 

a 1 0 


= £ ij k 


a k 0 
0 a k 


B = 1 0 ) = 5^* 


Hence we write 

-o/ • / & 0 


75 <7 = i 


0 a 1 


= 


fjLV J 


75 ct' 7 = e ijk ( ? M = -I£ ijk a 0k = +i £ ijk0 a k0 = -£ ijflu a^ . 

\<7k 0 J 2 

Since the transition to other representations does not change anything, the re- 
lation is thus proven in general. 


15.6 Spin- 1 Fields 

For simplicity we restrict ourselves to fields with finite mass. For s = § the 
Bargmann-Wigner multispinor has three indices, and it is totally symmetric with 
respect to these. The equations of motion (15.10) read as 

(ifry • 9 ~ m o c) aa > Vafa CO = 0 , (15.65a) 

(ifry • d - m o c) 0/3 , = 0 , (15.65b) 

(ifry • d - m 0 c) yy , ^yCO = 0 . (15.65c) 

We now try to expand the field in totally symmetric matrices similar to (15.29). 
Because of the total symmetry in the first two indices we make the ansatz [cf. 
(15.29)] 
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Here ^^(x) is a “vector spinor” (fi = 0, 1, 2, 3 is a Lorentz index, whereas 
7 = 1, 2, 3, 4 is a spinor index), transforms like the product of a co- 

variant antisymmetric tensor and a spinor. Symmetries in the indices /? and 7, and 
consequently also the symmetry with respect to a, /?, and 7, is guaranteed if the 
coefficients (15.26) of the expansion of the matrix in the basis of a Clifford 

algebra also vanish, as was the case in the expansion of One gets these 

coefficients by contracting (15.66), with respect to the indices /3 and 7, with the 




’-1 


matrices , ( C 475)^ and (C *757 M )/3 7 , which yields, for example, 




1 = ' 
moc 1 


h ( / y ti C) a 0C l3 ^ fi 1 (x) + 


^ ("7 n)ccy'lp ^ 1 7 CO T - 7 CO — 0 


In this way one gets the three constraint equations 


h 7m^ m CO + 2 0 >i/ ,mi '(a) = 0 , 

(15.67a) 

h lyfls^ix) + = 0 , 

(15.67b) 

7 ^7m757aV' m C0 + ^<3>7s7a V^CO = 0 , 

(15.67c) 


where we have not written out the spinor indices explicitly any more. Multiplying 
(15.67b) by 75 and adding and subtracting (15.67a), respectively, leads to two 
equivalent conditions: 

7 nVix) = 0 , (15.68a) 

VV^CO = 0 . (15.68b) 

Furthermore we multiply (15.67c) by 75 on the lhs and make use of the commu- 
tation relations (3.11) and the relation (15.32), which yields 

- a ~ 7a70^C0 + ^7a d^V^CO 

- KduXlv ~ 9u \l»)V u (x) = 0 . 

Because of (15.68) and the antisymmetry of ^"CO* (15.69) reduces to 
mac 


(15.69) 


h 


-V’aCO - O^VvaCO = 0 


(15.70) 


Since the condition (15.68b) follows from (15.70) and (15.68a), namely 

-^~7aV ,A (a) - i'yx'y^^ix) = 0 - ^(7a7m - 7m 7 a CO 

= &»x V^OO = 0 , 

we are left with the two independent conditions: 

7m^C0 = 0 , 


(15.71a) 


MqC 


V^CO = OaV^CO 


h 


(15.71b) 
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This corresponds to 4 + 16 = 20 linear conditions between the 16 + 24 = 40 
components of i/+ and U ;l ^‘ y . so that the number of independent conditions reduces 
to 20 , which is the correct number of components of totally symmetric tensor of 
the third rank in a four-dimensional vector space. From the Bargmann-Wigner 
equations (15.65a) and (15.65b) then follows: 

rriQC 

— (ih -7 • d - m 0 cW(7//CV /#%(-*) 

+ ^(ih .7 • d - moc^ia^C^pip^^ix) = 0 , (15.72a) 

^7^ (ifry • d - m 0 c) / 3 / 3'V ,, V x ) 

+ ^(^C) a/ 3 /(ih ,7 • d - = 0 . (15.72b) 

In (15.72b), again the products 7 M C 7 ^ and a^ v C^\ appear [cf. (15.30)]. Using 
the properties of C we rewrite (15.72) in the form 

~Y [(i h'-j u d u - m Q c) 'y fl C] a0 i> tJ ' y (x) 

+ \ [0%a5 a - mocJa^C}^^^) = 0 , (15.73a) 

- m 0 c)C] a0 ip^-yix) 

+ ^ [vfiAifrl \d x + m 0 c)C] a/3 ^yix) = 0 . (15.73b) 

The difference of (15.73a) and (15.73b) yields 

/v 2 m 2 c 2 

2m 0 c(a^ tl C) a/3 d u ’4) ,i y (x) ( 7 mC'WV’'V-* : ) 

- moc(<3>C) a/ 3 V+%(x) + ^ih(['yx,& fi v]C) af3 d x ip tll '~ Y (x) = 0 . 


Because of (15.32), this corresponds to 

^(<5v M <7)a,3 [9 ‘'^■yix) - + i)fU' 1 (x)] 

T lm 2 c 2 

-( 7 „C) a/3 -l-f 7 (r) + 29/ l ' 1 (r) =0 . (15.74) 

The coefficients of the linear basis matrices a^C and 7 must vanish. For 
mo 7 ^ 0 this yields similar relations as were found in (15.34) and (15.35) for the 
Proca fields: 

ip^(x) — d^ip^ix) — d v ^{x) , (15.75a) 

d^(x) = . (15.75b) 

Because of the antisymmetry of ip^ix), then from (15.75b) it again follows that 
d^^x) = 0 , (15.76) 

and therefore also that 

m 2 c 2 

a/>%(x) + -± r ip‘ t ^x) = 0 . (15.77) 



386 


15. Wave Equations for Particles with Arbitrary Spins 


Thus each spinor component of fulfils the Proca equation, and the tensor 

field x ) is uniquely determined by Correspondingly the third Bargmann- 
Wigner equation yields 

^■(7 tiC) a p [(ihry ■ d - m 0 c)ip^(x)\^ 

+ \(°^C) a0 [(ifvy ■ d - mo cW # "'(*)] 7 = 0 . (15.78) 

However, this is automatically fulfilled since [because of (15.71a)], (15.71b) and 
(15.75a) combine to form 

mocip^ = = Ki'yx (<9 A '0 m — = ih'y • d , 

which is just the Dirac equation for each Lorentz component of 'tjj^(x). Furthermore 
from (15.75) it also follows that 

(ifi/y • d - moc) ip^ix) = 0 , (15.79) 

so that the validity of (15.78) has been proven. Thus the equations of motion for 
the spin- 1 field reduce to the Dirac equation for the independent vector-spinor field 

(ih/y • d — mo^rp^ix) = 0 , (15.80a) 

together with the four constraints 

7^00 = 0 . (15.80b) 

The “gauge” condition (15.76) can be derived by multiplying (15.80a) with 77 to 
give 

0 = (ih'y^ u d u - m 0 c7 M ) x ) = 2ifr (g^ - d y ^(x) 

= 2 i hd^ix) , 

so that this is not an independent relation. The equations (15.80) are known as the 
Rarita-Schwinger equation. 6 They can also be combined into one single equation, 
namely 


[(ih'y ■ d - m 0 c)g^x ~ \iMrhidx + ^7a) + f 7*^7 • 9 + m 0 c)yx] 


x ip x (x) = 0 , 

(15.81) 

since multiplying this equation by 7 M yields 


[\\hdx + \moc 7 a) ip x (x) = 0 , 

(15.82a) 

whereas the application of d M results in 


[7 • d (jihdx + jmoc'fx) - m 0 dx] ip X (x) = 0 . 

(15.82b) 


Inserting (15.82a) into (15.82b), then because of mo ^ 0, it follows that 

dx4> X (x) = 0 , 


W. Rarita, J. Schwinger: Phys. Rev. 60, 61 (1941). 


6 
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and thus (15.76), and (15.82a) are equivalent to (15.71a), i.e. 

1 \i> X (x) = 0 . 

With these two conditions (15.81) immediately reduces to (15.80a). Combining the 
four four-spinors tp x (x ) into a 16-component spinor 


X = 



one can write (15.81) as 

( iha u d 1 ' — flmoc ) x(jc) = 0 


(15.83) 


where the 16 x 16 matrices (3 and have a Lorentz index as well as a spinor 
index: 


(M m a =: 7 v9n\ - f7/i(9i/A - 7*7 a) - f 7 a g^u , (15.84a) 

=■ 9»\ ~ j7m7a • (15.84b) 

Each element of these matrices with the indices fx, A is a 4 x 4 matrix in the spinor 
indices. Now one has the equation 

Pn\ {g x v - i x iu) = {g^ x - 7m 7 a ) P\u = g^ , (15.85) 

from which it follows that the matrix /3 has an inverse, 

(P~')nu=9^- 7m7. • 05.86) 


Using 

(■ r = (p la ^)fj,<r = (p x ) a 

= (9 » A - 7m7a) [lu9 X a - \l X (9vo - Ivla) ~ \l a9 X v] 

— r )v9fJicr 37 l*(9vcr ~ 7i^7cr) — j7 <^9^ 

~ "1“ 37^(sw ~ 7i'7cr) H“ 37/i7^7cr 

7 v9fJi<J "h 'yti9v(T ~ 7/i{7^j7<r} ~ 37 cr9jj>v 37)Li7^7cr 

= 1 v 9 ijl(j ~ 7/x5W — ^7 "1“ 37/i7i'7cr ? (15.87) 

Equation (15.83) takes the so-called Fierz-Pauli-Gupta form 

(i hr^ - m 0 c ) x(x) • (15.88) 

Here the matrices r ^ obey the commutation relations 

X ( - g^) A/ 1 , = 0 , (15.89) 

(P) 

where implies a sum over all possible permutations of /x, A and g. 
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Y. Takahashi: Introduction to Field Quantization (Pergamon Press, Oxford 1969) 

A.S. Wightman: Invariant Wave Equations; General Theory and Applications to the External 
Field Problem , Lecture Notes in Physics, Vol. 73 (Springer, Berlin, Heidelberg 1978) 

D. Lurie: Particles and Fields (Interscience, New York 1968) 
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16. Lorentz Invariance 

and Relativistic Symmetry Principles 


16.1 Orthogonal Transformations in Four Dimensions 


We consider the four-dimensional space with coordinates x fl (p — 0, 1, 2, 3), which 
- assuming the most general case - may be complex numbers. The absolute value 
of the position vector is given by (summation convention) 

5 = \J <J ^ijX^ X u = y/x fl X^ , (16.1) 

and the length s can also take complex values. Examining an orthogonal transfor- 
mation which relates each point with coordinates to new ones 

x w = a\x» , (16.2) 

the absolute value should remain unchanged by this transformation (this is the 
fundamental, defining condition for orthogonal transformations), i. e. 

s' = yjx'^x^ = y/x^xV - s . (16.3) 

From this relation x' l ‘x' fl = a fJ a a l , T x„x T = x a x a it directly follows that 

a ti (T a y T — 6 T a (cr, r = 0, 1,2,3) , (16.4) 

implying that the a generate a linear orthogonal transformation. Equivalently to 
(16.2) and (16.3), (16.4) can be envisaged as a defining equation for orthogonal 
transformations: Sequential application of two transformations gives 

x" v = a'V'" = a' W = f3\x a , (16.5) 

where the transformation 3 U „ again represents an orthogonal transformation, since 

£//?% = a'^a/a'^^Q - 8» £ a^a s e - 6% . (16.6) 

Thus the orthogonal transformations a" M form a group. The unit element of this 
group is given by a/ = 8^ v (the identity transformation) and the existence of 
the inverse elements follows from (16.4) (a a -1 = 11). Accordingly all conditions 
characterizing the group structure are fulfilled (the associative law is also valid 
because the transformations a u are linear and orthogonal). 

This group of homogeneous, linear transformations that keep the distance be- 
tween two points invariant is called a group of the four-dimensional, complex, or- 
thogonal transformations. In analogy to the three-dimensional group 0(3) we use 
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the compact notation 0(4) [more precisely, 0(4, C) because this group is defined 
over the field C of the complex numbers]. 

At first a^v has sixteen components; however, due to the ten orthogonality 
relations (16.1) only six independent (complex) parameters remain. Additional re- 
strictions for the a^y yield the subgroup of 0(4, C): Let us assume all x,_ t to be 
real, then the same must hold for all a ;/ . We are led to the real four-dimensional 
group 0(4,/?), whose group properties follow from those of 0(4, C). 

Furthermore, we may consider the case that the coordinates entering the scalar 
product of two vectors may occur with different sign, eg. 

d^ 2 = d t 2 - dx 2 = -dx 2 + d t 2 . 


The orthogonal group keeping this line element invariant is called 0(3, 1). (In 
general the group O (p, q) leaves a bilinear form . .-x 2 +x 2 ^, +. . ,+x 2 , 

invariant.) This non-symmetric expression for the line element can also be treated 
in the framework of 0(4, C), if one does choose the coordinates xi,X 2 ,x^ as real 
and the coordinate a'q = it as purely imaginary — the corresponding space being 
called Minkowski space. In that case the transformed coordinates must also 
have the same properties, which yields for the elements a 


Oil; 

«00 


(i,k = 1,2,3) 


real 


aoi , a,o (* = 1,2,3) imaginary 


(16.7) 


The group, whose elements are restricted by the conditions (16.7), is called the 
homogeneous Lorentz group L of the Minkowski space, and the elements are given 
by the Lorentz transformations. 

Further subgroups already known to us can be obtained by the restrictions a i0 = 
aoi = 0 and <%) = L The group with these properties represents the complex, three- 
dimensional group 0(3, C) [0(3) and 0(4) always means 0(3, C) and 0(4, C), 
respectively]. For real we are led to the real, three-dimensional group 0(3,/?). 
In the following we shall discuss the groups 0(4) and L in more detail. 


16.2 Infinitesimal Transformations 
and the Proper Subgroup of 0(4) 

First we write down an infinitesimal element of 0(4): 

«/ = <$/+£/ (k/l«:i) • (16.8) 

The transformation of x 1 ' then reads 

x>v = + e*V) , (16.9) 

from which, according to the orthogonality relations for a ", it follows that 

= 6% = (V + e /) {8\ + e\) 

= 6^6% + 8/e^ + e/6% + 0(e 2 ) 

= fi U <T + £ V O + So- 1 ' + 0(e 2 ) 
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In order to satisfy the orthogonality relations at least up to terms of the order 0(<s 2 ), 

(16.10) 

must be valid. Since 


= g t ' Q e 


(TQ 


— a VQ f 

— y ^Qcr 


we also obtain [in view of (16.3)] e QG — e aQ , i. e. the infinitesimal quantities 
have to be antisymmetric. In accordance with our result obtained previously, this 
implies that there exist six independent parameters of an infinitesimal transforma- 
tion. In the case of 0(4) the are arbitrary complex numbers, while they are 
all real for 0(4,/?). As already mentioned before, the group parameters of L e ik 
(i,k = 1,2,3) are real and £,o = £ 0 / are imaginary. 

In order to construct generators of 0(4) group transformations we rewrite (16.9) 
as 


x’" = {6\ + a s (irf)x» , (16.11) 

where a s denote the parameters of the transformation. In order to understand the 
meaning of the (Z)"^ let us have a look at the transformation of a vector with 
components A a \ 

A! e = D e a {oL,)A a . 


D - , 7 stands for the transformation matrix dependent on the parameter a, . Expansion 
of the matrix elements for small cc, yields 


A' e = 


( 4 *' + ? 


dP e 0 

dotj 


oa + 0(a 2 ) 


a=0 


(16.12) 


and comparison with (16.11) shows that 


(/) 


V (S) _ 
M 


da s ) a=0 


(16.13) 


is valid. Such a Taylor expansion about the identity transformation (a, — 0) is only 
possible if the D v ^ are continuously differential functions of the parameters a/; 
however, this is the essential condition for a group to be a Lie group, and since all 
groups considered in the following are Lie groups [Lorentz group, 0(4), 0(4, /?), 
. . .], the improper Lorentz transformations will be excluded from this discussion. 
In our case the parameters a s , stand for the infinitesimal quantities e Q a , which 
means that 5 is the short-hand notation for two indices (5 — > ga). Accordingly the 
infinitesimal operators (generators) (/)% have to be characterized not only by s, but 
by two indices g, c r. In order to avoid confusion and to distinguish between four- 
dimensional matrix indices and generator indices, we write the latter in brackets 1 
and rewrite (16.11) as 


1 Note, this notation appears somewhat different from the one used in Chap. 3 [e. g. (3.44)ff]. 

It is assumed that the reader will be able to establish the connection without major prob- 
lems. 
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x 


/V 


nf B) 

<*e 




(16.14) 


Since we sum over g and a, each term is counted twice. For this reason we have 
to multiply the sum by the factor 1/2. In view of the antisymmetry of e ga , we 

also choose (/%) to be antisymmetric with respect to the indices g, a. Parts 
symmetric in g, a would not contribute when contracting with the antisymmetric 
e ea . Thus we obtain six infinitesimal operators the generators of the 

orthogonal group 0(4). Comparing (16.14) with (16.9) we are led to a defining 
equation for /%: 

= e\ • (16.15) 

CFQ 


First we lower the tensor index u\ 

= g TV e\ = e. 


TfJ, } 


<J,Q 


or 


E _ (J — 

~£<7 Q\ l Tfl) ~ > 

(7,Q 

respectively. Since the sum over g and <j only contributes for g = fi and a = r, 
then it must terminate. Accordingly an ansatz for a solution of (16.15) must look 
similar to = g G Taking into account the antisymmetry with respect 

to (_ i , v, it follows that 

{Jnu) (ae) = g°»g e v - g°„g% = {-Kn) (<Je) , (16.16) 

and the antisymmetry with respect to a , g also becomes obvious: 
d \(<7e) /i \(q<? ) 

(V) =-(V) • (16.17) 

With the aid of the relation g» e = g IJU g QlJ = we can directly denote the matrix 
representation of the generators , e. g. 


(V) <10) = g'^v - gKg°n = 6 - 6 l „6° 



0 

-1 

0 

°\ 


+1 

0 

0 

0 


0 

0 

0 

0 


0 

0 

0 

0 / 


and for mixed tensors 


(K) m = g> T (lr„y u,> = 


( 10 ) 


( 


0 

1 

l ° 

V o 


1 o 0\ 
0 0 0 1 
0 0 0 
0 0 0 / 
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Furthermore, 


(° 

0 

0 

°\ 

(t 1 3 

(V) = 9 » 

1 9 

0 

0 

0 

+1 

II 

1 

0 

0 

0 

0 



Vo 

-1 

0 

0/ 



/° 

0 

0 

°\ 

/i x (2 1 ) 2 1 

(V) = 9 n 9 u 

9 1 

0 

0 

-1 

0 

1 

i 

ll 

0 

+1 

0 

0 



Vo 

0 

0 

0/ 



f ° 

0 

0 

°\ 

(j \ ( 32 ) 3 2 

(V) =9n9u 

- 9 3 u 9 2 u = 

0 

0 

0 

0 

0 

0 




Vo 

0 

+1 

0/ 



0 

0 

-1 

°\ 

(V) = 9 n 9 » 

? n 

0 

0 

0 

0 

1 

1 

< 

t 

ll 

+1 

0 

0 

0 



0 

0 

0 

0/ 



0 

0 

0 


(j \ ( 30 ) 3 o 

(V) =9 »9 V 

- 9 3 » g°n = 

0 

0 

0 

0 

0 

0 

0 

0 



V+i 

0 

0 

0/ 


Now the commutation relations of ( ^ can be easily verified: for example 

■(W 21) ,(W 32r 


(° 

0 

0 

°\ 

/° 

0 

0 

°v 


f° 

0 

0 

°\ 

0 

0 

-1 

0 


0 

0 

0 

0 


0 

0 

0 

0 

0 

+1 

0 

0 


0 

0 

0 

-1 


0 

0 

0 

-1 

Vo 

0 

0 

0/ 


Vo 

0 

+1 

0/ 


\o 

0 

+1 

0/ 


/O 0 0 0\ 

0 0-10 
0+1 0 0 
Vo o oo/ 


/0 0 0 

°\ 

/° 

0 

0 

°v 

/° 

0 

0 

°\ 

0 0 0 

-1 

0 

0 

0 

0 


0 

0 

0 

-1 

0 0 0 

0 

0 

0 

0 

0 


0 

0 

0 

0 

Vo 0 0 

+ (W 3) 

0/ 

Vo 

-1 

0 

0 / 


Vo 

+1 

0 

0/ 


Analogously we get 

[(+) o2, .(vr]^= + (u c,) 


( 16 . 18 ) 
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;(w < “.(wn.-+(w 01 ’ . 

(un_=+(W l3) ■ (i6.i9) 

(There are nine further commutators left, but they appear rather seldom in appli- 
cations.) All fifteen commutation relations can be summarized within a single one 
(for the sake of more clarity we omit the tensor indices /i and v and note only the 
parameter indices a and o\ we also drop the brackets ( ) on the parameter indices) 

[ 7°^ = _,5 a 7//3<5 _|_ §oa 5jf3y £0~(ja6 _ £06 jay 

or, if the parameter indices are lowered, 

_ —QaylftS "F 9as'l 0y + 90y^ab ~ 9 06^ ay 


(16.20a) 

(16.20b) 


These equations represent the Lie algebra of the group 0(4). It is convenient to 
introduce six linearly independent combinations instead of l ae (k, l = 1, 2, 3) 
/''<+) = I (/« + /'°) ? 

/'•<-> = I , (16 2]) 


where i,k,l^ are cyclic permutations of 1,2,3. From (16.20a) the commutation 
relations of /'(+>,/'(-> are derived as 

[r (+ >, /«+>]_ = -/'<+) , 

[/''<->, /*(->]_ = -/'<-> , 

[/«■(+),/*(-)]_ =o , 

i,k,l =1,2,3 cyclic . (16.22) 

Since /' (+) and I' ( ~ ) are not hermitian, we introduce the hermitian operators 
y'(+) = -i/'(+) , /'•<-> = -!/'(-) . (16 .23) 

Then, it follows from (16.22) that 

[y f( + ) ,y* ( +>]_ =i/'<+» , 

=ii ,( - ) , 

[y'-(+),y‘(-)]_= o , 

i,k,l =1,2,3 cyclic . (16.24) 

By introducing 7' (±) we have constructed two 3x3 matrices from the 4 x 4 
matrices I ae . Consequently we have left four-dimensional space and moved to a 
three-dimensional subspace, where /' (±) are the infinitesimal generators. Therefore, 
each four-dimensional transformation can be performed by combining appropriate 
transformations on the subspaces. Because of (16.21), one has 
jki = jn+) + j i{ -) and p 0 = jn+ ) _ ;/(-) 

i,k, l 1,2,3 cyclic , (16.25) 

and the J' {±) are the generators of the three-dimensional subgroup of 0(4). As 
(16.24) shows, they obey the same commutation relations as the angular momentum 
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components L x , L y and L z \ hence they are generators of an 0(3) subgroup. Since 
we have selected two three-dimensional subspaces of the four-dimensional space 
with the operators /' (+) and / !< \ the reduction from 0(4) to 0(3) yields six 
operators J' (±) (i = 1,2,3), instead of three. This point will later on be studied 
more rigorously in connection with the spin, but for the moment we want to get 
a more intuitive understanding of the meaning of the parameters e " ^ in the case 
of the real rotation group 0(4,/?). To do this we choose e x 2 = — s 2 i = £ and all 
other e u f, = 0. The components of the transformation (16.9) then read: 

x /0 = x° , 
x n = x 1 + ex 2 , 

x' 2 = x 2 — ex x , 

x / 3 =x 3 . (16.26) 


In fact this describes a rotation about the infinitesimal angle e within the (x 1 - x 2 ) 
plane, i.e. e 1 ? represents a rotation angle; hence (/" /J ) (12) is the operator which 
effects infinitesimal rotations in the (x 1 — x 2 ) plane. Therefore we can say, in 
general: e^ is the rotation angle about two axes 2 x c and x e which are perpendicular 
to the pu plane and, in addition, the pv plane remains unchanged by this rotation. 
All infinitesimal rotations of the group 0(4) are rotations in the six planes: 


XiX 2 , X 2 X 3 , X3X1, X1X0, X 2 X<), X3X0, 


The situation is different for the Lorentz group L of the Minkowski space (x 0 = ict ). 
Here £ 12 , £23 and £31 are real and have the same meaning as in the case of 0(4, /?); 
but £10 and £01 are imaginary (£10 = — £01 = is): if all other £ MI/ vanish, we have I 

,0 


*'° = -i£X* 
x n = x 1 + i£X 
x ' 2 = x 2 , 
x / 3 =x 3 . 


X 

0 


(16.27) 


This is no rotation, but an infinitesimal special Lorentz transformation along the 
x 1 axis, which becomes clear if we look at this Lorentz transformation in more 
detail: 

jc ' 1 = 7 (xi+iV) , 


x'° = 7 i-x 1 + x°j 
where 


7 = 



- 1/2 


2 This is just a manner of speaking analogous to that describing rotations of a plane in 
E 3 . Mathematically it is not correct only that causes a rotation of the plane spanned 
by the fi and the u axes into itself. 
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In the case of the two coordinate systems v/c = e ■ Cl this yields the result 
(16.27); hence £\q represents the ratio of the relative velocity and the velocity of 
light, and (7%) (10) is the operator which effects a special Lorentz transformation 
along the x 1 axis (, Lorentz boost). The parameters e 20 and e 30 and the operators 
(Z%) <20) and (Z"^ 30 ' have analogous meanings. 

By multiple application of the infinitesimal rotation we can perform a rotation 
about the finite angle ei 2 : 


x 


tv 


X 


w 



(16.28) 


i. e. in the case ± 0 and all other e jiv = 0, we have 

x ,v = expje 12 (/ t ' At ) (12) Jx M . (16.29) 

Here we should note that there exist transformations in 0(4) which cannot be 
achieved by infinitesimal transformations (i.e., the space and time inversions). 
Such examples will be discussed later on. 


Remark. If all group elements can be created from the unit element by infinitesimal 
continuous variation of the parameters, then the group is called connected [0(4) is 
not connected!]. 


16.3 Classification of the Subgroups of 0(4) 

Now we want to study the properties of the group 0(4) more rigorously. To that 
end we consider an arbitrary transformation a = [a^ u ]. The orthogonality relations 
read 

a n a> *a = S u a (16.30) 

or, in abbreviated notation, 

aa = t , (16.31) 

where ~ denotes the transposed matrix. Therefore, for the determinants it holds 

that 

det (aa) — deta ■ deta = (deta) 2 = 1 , 

which implies 

det a = ±1 (16.32) 

Transformations with determinants ±1 are called unimodular. the group 0(4) and 
its subgroups are therefore unimodular. By classifying the elements into those with 
det <3 = +1 and det a = -1, respectively, 0(4) is divided into two parts. First 
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we consider the group 0(4,/?): The part with det a = +1 is denoted by S0(4 ,/?) 
(SO for special orthogonal transformations). Besides the identity transformation, 
it contains all infinitesimal transformations (det (8"p, + e u M ) = +1), i.e. all finite 
transformations which can be assembled by these infinitesimal transformations be- 
long to the group S0(4 ,/?) (proper four-dimensional rotations). A typical member 
of the other part of 0(4,/?) with det a = — 1 is the coordinate inversion 

X ,k = -x k , x , 0 =x° (* = 1,2,3) . (16.33) 

Since 

/I 0 0 0\ 

„ _ [ 0 -1 0 0 I 

a M - I 0 0-1 0 ’ 

Vo 0 0 -\) 


this transformation belongs to the group 0(4,/?) (all are real) and it cannot 
be decomposed into infinitesimal transformations. Furthermore this second part 
of 0(4,/?) is no group, because it does not contain the identity transformation; 
but, together with the S0(4,/?), it does make up the whole group 0(4,/?)! [These 
statements are also valid for the 0(3,/?).] To study the properties of the Lorentz 
group L in more detail, the situation is more complicated because, in addition to 
det a = ±1, we find with the orthogonality relations that 


o = a k °a k 0 + ao°a 0 0 = 1 • 

Since a k o is imaginary, it holds that a k °a k o = — \a°\ 2 and consequent! 
(«o°) = 1 + |tf*°| 2 > 1 i 


l. e. 


a 0 ° >1 or ao° < — 1 . 

Thus the Lorentz group can be split up into four parts, namely: 
Part I or L} + : det a = + 1 , > 1 (orthochronous LT) 

Part II or Li : deta = — 1, Oo° > 1 (orthochronous LT) 

Part III or Li : deta = -1, oo° < -1 (antichronous LT) 

Part IV or L+ : det a = +1, ao° < — 1 (antichronous LT) 


(16.34) 


(16.35) 


Part I is named the group of proper Lorentz transformations. It includes the unit 
transformation, infinitesimal transformations and their iterations, i.e. all spatial 
rotations and the special Lorentz transformations of the Minkowski space. 

In the following we shall denote Part I by L p (p = proper). L p is a subgroup of 
L. On the contrary Part II or L}_ is no group, and a typical representative member 
is again space inversion. [Thus everything mentioned in connection with 0(4,/?) is 
valid.] Together with L p , L}_ forms the so-called full Lorentz group Lf, which is a 
subgroup of L. It includes the unit transformation, space rotations, special Lorentz 
transformations, space inversion as well as an combination of all members. 
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The basic element of Part III is time reversal 


x" = x' ( i = 1, 2, 3) , 


© 

II 

1 

* 

o 





that is 


/- 1 

0 

0 

°\ 

1/ 

0 

1 

0 

0 

a — 

° 

0 

1 

0 


V 0 

0 

0 

\) 

det a v ^ = 

-1 , 

a 

0 

o - 

-1 


(16.36) 


However, Li does not form a group because this part does not include the unit 
transformation. The same is true for Part IV, one example being the total inversion 
of Minkowski space: 


x m = _ x » (n = 0, 1,2,3) . (16.37) 

L,[ j_ is not a group, either. 

Together with Lf, Li and L+ form the so-called extended Lorentz group. Be- 
cause Lf (full Lorentz group) does not change the sign of the zero component of a 
time-like vector, it is frequently denoted orthochronous Lorentz group. 


16.4 The Inhomogeneous Lorentz Group 

The inhomogeneous Lorentz group L keeps the distance between two points of the 
Minkowski space invariant, and the transformation of points of the Minkowski 
space by L has been given by x' v — a v ^ . We now drop the requirement of 
homogeneity and use the transformation 

x' v = a u „x^ + 0 V . (16.38) 

It can be seen that the term /3 l/ cancels in the expression for the distance be- 
tween two points, i. e. (16.38) also keeps the distance between two points invariant, 
whereby the orthogonality relation = b„ v holds once more. [Of course, 

the transformations (16.38) do not keep the length of x M invariant.] Furthermore 
/3° has to be imaginary and the /)' (i = 1, 2, 3) have to be real and obviously the 
/?" characterize space-time translations. The transformations (16.38) form a group, 
the so-called inhomogeneous Lorentz group or Poincare group P. 

Like the homogeneous Lorentz group, P also disintegrates into four parts, e. g. 
the proper inhomogeneous Lorentz group [the group theoretical expression: SO(4) 
corresponds to ISO(4); I for “inhomogeneous”]. 

P has ten parameters (six independent a v M and the four components of f3 u ). The 
subgroups of P are the homogeneous Lorentz group L and the four-dimensional 
group of translations S, whose elements are given by (x L ') / = x v + (3 V . Because the 
transformations do not commute with the homogeneous Lorentz transformations, 
the group P is not the direct product of L and S, i. e. 


a v nix* + P) ± a v ^ 


(16.39) 
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We now consider the translational group and first look for the infinitesimal gener- 
ators of the transformation P „ (o = 0, 1,2,3). The infinitesimal transformation is 
analogous to (11): 


= (l + ~fr £e Pej ■ (16.40) 

The e 8 are the infinitesimal transformation, i.e. x' e = x 8 + e 8 (|e e | <C 1), though 
because of (16.12) we have 


ipx ' 8 = ip(x e + e 8 ) — i{j(x 8 ) + £ 8 ^r~ 

OX 8 

Comparison of (16.40) and (16.41) yields 


P e = — i h 


d 

dx 8 


(16.41) 


(16.42) 


which are the generators of the infinitesimal translations. As in the case of non- 
relativistic quantum mechanics, the momentum operator is connected with space 
(time) translations. Each finite translation can be generated by iterative application 
of (16.42), i.e. according to (16.28) and (16.29) the unitary operator 


U = exp 



(16.43) 


is the relativistic translation operator. 

The translational group S is an abelian group, because 

[Pp,P v \=0 ■ (16.44) 


However, P e does not commute with the generators (/ ty M ) ( ' <T) (16.13) of the ho- 
mogeneous Lorentz group and therefore does not commute with the invariants 
constructed with J i(+) and J l( ~\ though it is possible to show that the operators 

Pl=P ^ and W x = l -Y j e Xtlue P^ (16.45) 

commute with all and P Qy i.e. P ^ and W\ are the invariants (Casimir op- 
erators) of the Poincare group. For a free particle we have = m^c 2 , the 

particle’s mass, and the operators W\ are connected with the spin of the particle, 
as will be seen later. Obviously the Casimir operators provide all quantities (mass, 
spin) necessary for the description of the free particle, which is the reason for the 
great importance of this group in modern elementary particle physics. 
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16.5 The Conformal Group 


In the preceding section we have seen that the generators of the physically impor- 
tant Poincare group are given by the six generators of the homogeneous Lorentz 
group (I u n) {ecT) and the four generators of the space-time translations. The 
Poincare group can be extended to the 15-parametric conformal group of the four- 
dimensional Minkowski space, by considering transformations that keep the light- 
like line element ds 2 — (dx 0 ) 2 - (dx 1 ) 2 - (dx 2 ) 2 - (dx 3 ) 2 = 0 invariant. That is, the 
conformal transformations include changes of length, but keep the angle between 
two vectors invariant (hence “conformal mapping”); thus we have to add five more 
generators to the ten generators of the Poincare group. The first operator is that of 
scale transformations or dilatation D and it yields the transformation 


x ,/x = qx» {q > 0) . (16.46) 

The four remaining generators create the so-called proper ( special ) confor- 
mal transformations , which change the length scale point by point, thus forming 
position-dependent dilatations. These transformations can be written as a product 
of an inversion I \ : 






a translation t : 


x "»= x *- a » 


and a further inversion I 2 . 
k 2 




(x’jx'y 


.Hfl 


(16.47) 


All together, this yields the special conformal transformation K —IfTI 1 : 
x»-a»x 2 


r'M = 


o(x) 


with 


a(x) = 1 — 2 a u x u + a 2 x 2 and 


(16.48) 


a 2 = a^a” 


2 V 

X — x u x 


For the conformal group we get a total of fifteen infinitesimal generators: 


■ six generators of the Lorentz group j generators of the 

Pfj, : four generators of the translational group f Poincare group 
D : one generator of dilatations 
K M : four generators of special conformal transformations. 
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It is possible to give an explicit representation of D and K^, by 

d 


D = ix 1 ' 


dx v 


O 

Ka = i I IXaX 1 ' — 

m M dx u 


— X 


d 
dx p 


(16.49) 


Furthermore, one may calculate the commutators of D and with the remaining 
generators of the conformal group by help of the explicit representations (16.16), 
(16.20), (16.42) and (16.49): We find 


[W] 

— i Pfi > 

[P»,K\ 

= 2i 


= 0 , 

V>X] 

= , 

[K^K] 

= 0 , 


= i (g eX K a - : 


(16.50) 


The importance of the conformal group is based on the fact that, e. g., the Maxwell 
equations without sources are not only Lorentz invariant but also conformal in- 
variant! However problems do occur in the discussion of the Maxwell equations 
with source terms or of equations of motion for massive particles: Because of 
m\c 2 = P^P M and 


A6D 


P M P M e 


-i 6D 


= e 2(? P M P M 


(16.51) 


the rest mass is not invariant with respect to dilatations, i. e. equations of motion 
like the Dirac equation including a mass term are not conformal invariant. An 
exact symmetry of dilatation is thus only possible for a continuous mass spectrum 
or vanishing mass. 


exercise 

16.1 Transformation Relations of the Rest Mass Under Dilatations 
Problem. Show the validity of (16.51) with the help of (16.50)! 

Solution. Use [P^, D] = iP M . For any given A and B the Champbell-Hausdorf 
relation holds: 

e~ A Be A = B + 1[B, A] 4- ^ [[B, A], A] + . . . (1) 

In our case we have A — — i OD and B — so that 

[B,A] = -i e[P^,b] =-ie{[p lM D]P» + P tl [P t *,D}} 

= -i e {i + i P m P m } = 2 eP^P^ = 20B 


( 2 ) 
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Exercise 16.1. 


i. e. the multiple commutators in (1) always reduce to [B, A] = 2 OB. Thus it is 
possible to write down the series in a closed form and perform the summation 

jebp^p„ t -ieb = p^p„ + 2d p^pn + 1(2 efP^P* -f . . . 

= P fl P^ l + ^ + I(20) 2 + ..^ 

= e 2e P fJ ,P IJ ' . 


16.6 Representations of the Four-Dimensional Orthogonal 
Group and Its Subgroups 

In this section we consider only the homogeneous proper groups. 


16.6.1 Tensor Representation of the Proper Groups 

We consider a linear space of dimension 4 n . One element or “vector” of this 
space has 4 n components, which is denoted by with indices equal 

to 0, 1, 2, 3. In the case of the group SO(4 ,/?) all components are real; however for 
L p all components including an odd number of indices “0” are imaginary, all the 
others being real. [This will become clear at once by taking into account the law 
of transformation (16.52).] First we look at a proper transformation x v — > x w — 
a y of SO(4 ,/?) or L p . Here xfj should transform like 

't/Vi. ../X„ ~ ~ a P L l 1(3 M2 2 • • * a tl n l ' n '4 ) V\.. .V n } (16.52) 

and the identity transformation reads 

'&pL 1 ...pL n ~ 1( ^M2 2 * * * n ^l ~Vn * (16.53) 

Because of the orthogonality relations for a ^ the transformations (16.52) form a 
linear transformation group in 4" dimensions; thus we have found a connection 
between the group SO(4 ,/?) (of L p ) and the linear transformation group in 4" 
dimensions (mathematically: an isomorphism). 

Hence one says: The transformations (16.52) form a 4 n -dimensional represen- 
tation of the group SO(4 ,/?) (or L p ). The elements of the 4 n -dimensional repre- 
sentation space we shall name tensors of rank n , and the tensor transformation has 
the form 

Vv-. e ->■ . . . a £ w Vv...w • (16.54) 

A tensor of rank 2 accordingly transforms like 




(16.55) 
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The most simple representation is the one without indices, the scalar representation , 
where the transformation obeys 

*0 — » ip f = -0 . (16.56) 

The next possibility is for n = 1, i. e. in 4 dimensions: 

3 

Vv ^ = • (16.57) 

i /=0 

This is the vector representation. Since the group SO(4, R) is four dimensional, 
the Vv are just the self-representation of the group: we denote the quantities as 
four-vectors. The next representation is the tensor representation of rank 2, which 
is 4 2 = 16 dimensional. The behaviour of this transformation follows 

i/V = a^a^ipev . (16.58) 

We can also interpret this as follows: The tensors i [) M „ can be written as a 16- 
component vector in representation space, using the transformation law (16.58), 
as 


$,=Ww (M ,N = 1, . . . , 16) , (16.59) 

whereby the 16x16 matrix An m is built up of the corresponding Symbolically 

we can write this as A = a ®a. The components of a tensor of rank 2 thus transform 
like the product of two vector components. In general, tensors of rank n transform 
with the 4" x 4" matrix: 

A — a ® a <g> a <S> . . . ® a , 

' V ' 

n factors 

i. e. the components transform like the product of n vector components. Hence we 
conclude: 

All tensor representations of the groups 50(4, R ) and L p can be reduced to the vector 
representation (they are reducible). Only the scalar and the vector representation 
are irreducible. 


16.6.2 Spinor Representations 


First we consider the representations of the group SU(2, C), i.e. the group of 
two-dimensional unitary, unimodular transformations: 


u[ = au\ + bui , = cu i + d u 2 


Because of unimodularity 


detM = 


a 

c 


b 

d 


= ad — be = 1 


(16.60) 


(16.61) 


and because of unitary 
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must hold; therefore d = a* and c = -b*, i. e. 


( u \ [\ = ( a b\(u , 

\u'J \-b* a*){u 2 
and (16.61) becomes 

aa* + bb* = \a\ 2 + \b\ 2 = 1 . 

The group elements are the unitary, unimodular matrices 


(16.62) 


(16.63) 


(16.64) 


(16.65) 


and the group has three real parameters: the complex numbers a and b minus the 
condition (16.64). These are called the Cayley— Klein parameters. 

The elements of the complex linear space which have the transformation be- 
haviour (16.63) are called elementary (or two-component) spinors of the three- 
dimensional space [SU(2) is an isomorph to SO(3. R)]. 

We consider the linear space of monomials (a monomial of degree v is an 
expression of the form: x a y b . . . z c , where a + b + . . . c = v) of degree v: 

P k = , (16.66) 

(e.g. P 0 = U] , P\ = u\~ x u 2 , etc.), where v and k are integers and 0 < k < v. 
For a given v there are v + 1 monomials of the form (16.66), i.e. the space is 
(v + 1) dimensional. It is also the desired representation space of the group SU(2). 
Applying the transformation M on P k , one notes the following: 

1. Because of the linearity of the transformations, application of the M results 
once more in P k s, or linear combinations of P k s. 

2. The product of two SU(2) transformations has the same effect on P k as the 
single transformations applied to P k one after another. 

3. The identity transformation of M with a = 1 and b = 0 corresponds to the 
identity transformation P k — > P k . in representation space. To the matrix M 
corresponds a (v + 1) x (t> + 1) matrix in representation space. To find this 
matrix, we apply M on P k : 

Pk \ = ( au i + bu 2 ) v ~ k {—b*u\ + a*u 2 ) k (16.67) 


Obviously it transforms according to 

V V 

— '^DyPi (16.68) 

1=0 1=0 

after ordering according to monomials of P k in (16.67). The matrix elements D u 
of the matrix D [a (v + 1) x (v + 1) matrix] depend on the parameters a and b of 
the transformation in SU(2). 
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It is advantageous to introduce the abbreviation j = v/2, whereby j = 0, 
It is advantagous to introduce the abbreviation j = v/2, whereby j - 0, 1/2, 
1 , . . . can assume all integer or half-integer values. We denote the representation 
which we have constructed this way by O' , where Dj = {. . . IP (M ) . . . } is the set 
of (2 j + 1) x (2j + l)-dimensional matrices which depend on the parameters of the 
SU(2) transformations M . The corresponding elements of the (2 j + l)-dimensional 
representation space are called spinors of order 2 j of the three-dimensional real 
space. 

The most simple representation is D 0 : All elements M of SU(2) have as their 
image the number 1, and the representation D 1 / 2 is SU(2) itself: P\ = «i, Pi = u 2 
are the two components of the elements of representation space, higher represen- 
tations being obtained analogously. 


EXERCISE 

16.2 D 1 Representation of SU(2) 

Problem. What does the D X (M) representation of SU(2) look like? Give the matrix 
D X (M) in the case of 



Solution. D 1 operates on three-dimensional space since there are three components 
of representation space: 

P 0 = H 2 , Pi = UiU 2 , Pi = «2 ) 

and, additionally, v = 2 j = 2. With (16.67) and (16.68), it follows that 

2 

P % = "y ^ D/ciP i = ( au i + bui) v k (— b*u\ -I- a*ui) , 

1=0 


Po = DooUi + DoiUiUi + Z) 0 2«2 

= ( au\ + bui) 2 = « 2 m 2 + 2abu\Ui + b 2 u 2 , 

P[ — D\qU 2 + D 11 U 1 U 2 + O 12 M 2 
= ( au\ + bu 2 ) (—b*ui + a*u 2 ) 

= —ab*u 2 + (aa* — bb*) u\u 2 + a*bii 2 , 

P 2 = Diou 2 + D 2 \U\u 2 + D 22 ul 

— ( -b*u\ -I- a*u 2 ) 2 = b* 2 u\ - 2a*b*U\U 2 + a* 2 ul 
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Exercise 16.2. 


Comparison of coefficients with 

o\ / Ax> Do, Dq 2 \ / Po\ 

P,' = D, 0 D n D n />, yields 

\P'/ \D 20 d 21 D 22 J \P 2 J 

/ a 2 lab b 2 \ 

D\M)=l-ab* ( aa*-bb *) a*b ) 

V b* 2 -2 a*b* a* 2 ) 

One can now show in a lengthly algebraic proof that the representations D’ : 

(a) are irreducible, i. e. there is no representation D 1 ' which could be built up from 
Dh, with j 2 < j \ ; and 

(b) the D n represent all possible representations of U (2). 


16.7 Representation of SL(2, C) 


The group SL(2, C) is the group of the linear complex 2x2 matrices with the 
determinant +1. We consider now the four-dimensional unimodular group SL(2, C). 
The transformations are, analogously to SU(2), given by 


u[ = aii\ + bu 2 , u' 2 = cm, + du 2 


The only restriction now is the unimodularity 


det M = 


a 

c 


b 

d 


— ad — be = 1 . 


(16.69a) 


(16.69b) 


[The unitarity drops in the case of SL(2, C).] 

Since SL(2, C) is isomorphic to Lp, we denote the elements u = (m, , u 2 ) of 
SL(2, C) as two-component spinors of Minkowski space. 

The complex conjugate transformation to (16.69a) reads 


w{ = a*u\ + b*u 2 , u' 2 = c*U[ + d*u 2 


with 

detM = 


b* 

d* 


= a*d* - b*c* = 1 


(16.70a) 

(16.70b) 


The dot on the components (e. g. mj , u 2 etc.) denotes complex conjugation. Because 
of the independence of a and a* etc., the elements it = (mj , u 2 ) behave differently 
under transformations to the elements u, i. e. the group has six real parameters 
[between the four complex coefficients a, b,c,d there exists only the one complex 
relation (16.69b)]. Thus the representations split into two groups: The spinors u, 
to which _belong all matrices M , and the dotted spinors u, to which belong the 
matrices M . Correspondingly we construct the monomials analogous to P k in the 
spinors of degree v and in the dotted spinors of degree v' as 

n y — k W — k' k 

* kk f — It j ^2 


(16.71) 


16.8 Representations of SO(3 ,R) 


407 


Altogether the monomial (16.71) has the degree v + v\ e. g. one has 

Poo — , Pol — l U2 , 

P\o = u\~ x u\u 2 , Pn = u\~ l u\ ~ x u 2 U2 , 

and since it, it' are integers, then 0 < k < v and 0 < k' < v' . For fixed v and v' 
there are altogether ( v + l)(i/ + 1) monomials, i. e. the representation space has the 
dimension (v + 1)(V + 1) and the monomials (16.71) span this representation space. 
Now we again introduce j = v/2 and j = v' / 2 and denote the representations by 
D jj ' , where;,;' = 0, 1/2, 1, 3/2, 2.... The representation has the dimension 
(2 j + 1)(2 ;' -f 1) and its elements are called spinors of rank (2 j + 1 )(2 ; ' + 1 ) of 
Minkowski space. Of course, the representations are not unitary. Now we consider 
the matrix form of the representation D 77 = {. . . D 1 ’ (M,M )...}, whose most 
simple representation is D 00 , which is one-dimensional and yields the image 1 for 
every element. The next representation is D 2 °, which is the self-representation of 
SL(2, C) by the spinors u\ and u 2 . The four-dimensional representation is based 
on the elements 


Poo = M] Mj , P 0 ] = U\U 2 , Pm = U 2 u\ , Pll = M 2«2 • 


With the aid of the unimodularity relation (16.69b) one obtains analogously, for 
example for SU(2), the image of the transformation M of SL(2, C): 


DH(M,M) 


/ aa* 

ab* 

ba* 

bb 

' ac* 

ad* 

be* 

bd 

i ca* 

cb* 

da* 

db 

\ cc* 

cd* 

dc* 

dd 


(16.72) 


The D ij ' represent all irreducible spinor representations of SL(2,C) and thus also 
of L p [because of the isomorphism of SL(2, C) and Lp]. 


16.8 Representations of SO(3, R) 

The three-dimensional rotation group SO(3 ,R) is isomorphous to the group SU(2). 
We have seen that all irreducible representations of SU(2) are spinor representa- 
tions, and therefore this holds for SO(3. R), too. We have already constructed the 
tensor representation to SO(3. R): we now show that the tensor representations are 
included in the representations D j , starting with D x . Therefore we transform the 
basis in representation space {Po,P\,Pi) with the help of the matrix 

/-I 0 1 

t = I -i 0 -i 

V 0 2 0 


V’i = —Po + Pi 
ip 2 = -iP 0 - i P 2 
ih = -2Pi 


into 


xjt - TP = 


(16.73) 
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Because of P' = D X {M)P [we have calculated D l (M) in Exercise 16.2], it holds 
that 


0 ' = tb\M)f- x xi) , 


where 

D x {M)t~ x 

I / a 2 - b 2 - b* 2 + a* 2 i (a* 2 + b* 2 - a 2 - b 2 ) -2 (ab + a*b*)\ 

= - i (a 2 - b 2 - a* 2 + b* 2 ) a 2 + b 2 - a* 2 - b* 2 -2i (ab-a*b*)\ . 

\ 2 (ab*+a*b) 2i (a*b - ab*) 2 (aa* - bb*) ) 

Because of aa* + bb* = 1, one directky obtains 

ip' ip' = ipiipj = invariant . 

Furthermore det (tb x T~ l ) = detD 1 = +1 and all matrix elements are real. The 
transformation is therefore identical to the vector transformation ip t — > ip[ = at k %p k . 
Hence it follows that D 1 is identical to the vector representation. So one can show 
that (we don’t want to give the proof here): 

All representations D> , where j is integer, are tensor representations, while all 
representations with half-integer j are proper spinor representations. 

Now we turn to the Lorentz group. 


16.9 Representations of the Lorentz Group L p 


The proper Lorentz group L p is isomorphic to the group SL(2, C); therefore the 
spinor representations D ij ' are all the irreducible representations of L p . First we 
consider the representation D^, which is based on: P m = w,Wj, P o] = mu^ and 
Pi i = m 2 « 2 > these transforming with the matrix Dh(M,F). Because of ad -be — 
1 and a*d* — b*c* = 1, it follows that 


PwP'u+PmPlo--PooPu+PoiPu> ■ (16.74) 

With the coordinate transformation ip = X/s/ltP or P = s/2T~ x ip, where 


T = 



°\ 

0 

-1 

i J 


(16.75) 


we now introduce new axes in P space. This transformation reads explicitly: 
ipi = j (P\o + Foi) , or Poo = ip3 - iip 4 , 

1 p 2 = Yi (P™ - Poi) , Foi = Ipl - ilp 2 , 

= \ (Pm -P\\) , P\a = V , 1+ i02 , 

04 = -^(Foo + Pll) , P\\-- 1 pz-\ 1 p 4 , 


(16.76) 
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and one obtains 




~7=77 >/ = 

V2 


Tb'i*(MM)P = Tbtt(MM)T~ l il> 


V2 


(16.77) 


If we insert the P,* (i,k = 0, 1) from (16.76) in (16.74), we see that the transfor- 
mation (16.77) keeps 


'I’M = 

invariant. 

Because the dot in Poo = 11 \ ll \ etc. denotes complex conjugation, oer recognizes 
from (16.77) that ipt, ^ 2 , ^3 are real and V>4 is complex. Furthermore 

det (Tb^f- 1 } = detZ)^ — detM x detM = +1 and 


(tbttf - | (|a| 2 + \b\ 2 + \c\ z + \d\ 2 ) > 1 , 

because of unimodularity. The transformation 77) Hp -1 transforms the W * n 
a manner similar to a four-vector in a proper Lorentz transformation, i. e. the 
representation D H is just the vector representation. The representation of the vector 
by the spinor components P is given in (16.76). 

We now show a relation between M from SL(2, C) and an element of L p , 
taking as an example a rotation around the z axis: 

= x\ cos <p + X 2 sin <p , 

*2 = — *1 sin tp + xi cos <p , 

*3 = *3 , 

x'=x 0 (16.78) 

or, equivalently, 

- ixo = X 3 - ix 0 , 
x{ — 1 x 2 = e 1(p (xj — 1x2) , 

x,' + ixj = t~ iip (xi + ix 2 ) , 

— X3 — ixo = — X3 — ixo . (16.79) 

If we compare this with (16.76), it follows for the spinor components that 
Poo = Poo , / > ,'o = e i ^P 10 , 

P',=e^P 01 , P[\ = P\\ • (16.80) 

Comparing this with the general transformation (16.72) we find: 

aa* = 1 , ad* = , da* = e~ iv , dd* = 1 

bb* = 0 , cc* = 0 , etc. 

The solution of these equations is 

a = ±t vp / 2 5 b = 0 , c = 0 , d = ±e _iip ( 2 . 
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The rotation around the z axis by the angle </? corresponds in SL(2, C) or in d'+ 
to the transformation 

M=D-*\M,M) = ±^ 0 e _^ /2 j or (16.81a) 

* j A ^ /p— iy?/2 n \ 

M =D°HM,M) = ±{^ Q J} /2 j , (16.81b) 

respectively, for the dotted spinors. 

In analogy to this one can show, e. g., that the transformation 

A I =b‘AM,W) = ±(l ,® 7 ) 

corresponds to a special Lorentz transformation of the z axis, where 


7 


2 


1-0 

l+P 


V 

and 0 = - 
c 


is valid. Therefore the representations d'+ and D 0 ^ are not tensor representations, 
but proper spinor representations. In general one finds for D 1 *' that 

(j + j 1 ) integer => D ^ is a tensor representation, 

(j +j ') half integer => .D / ■ , is a proper spinor representation. 


16.10 Spin and the Rotation Group 

We consider a field ipc+x^) whose components should have a distinct property 
of transformation, i.e. ip a (x fl ) is a member of a representation space of the 
proper Lorentz group Lp. If we perform an infinitesimal Lorentz transformation, 

then the change in ipaix^) is given with the help of the generators (l v ,0f ae) of the 
transformation (16.16) and the infinitesimal transformation e ae by 

= ipM) - ip a (x T ) = \e ce (l 0 a ) i<Te) ip 0 (x T ) (16.82) 

or, in matrix form, 

= ip'(x') - ip(x) = \£ ae I ((Te) ip(x) . (16.83) 

On the other hand we can only consider the local variation of ip, not of x ; then, 
we have 

8*ip = ip' {x) — ip(x) . 

The quantities Sip and 8* ip are not independent, because x' T = x T + e T a x a and 
therefore 

8* ip = ip' ( x' T — e T a x a ) — 1 p{x) . 


(16.84) 
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Now we evaluate this expression in a Taylor series and neglect terms of second 
order, which yields 

8*xp = xp' (x' T — ip(x T ) — e T c x c d T ) xp 

= 6xp — e T a x a d T xp (16.85) 

and therefore 

6*xp = j£ ae I i<re) xp - e 1 ' ^x^d^xp . (16.86) 

As we sum over a, g we relabel the indices: 

6*xP = - e^x^) xl> . (16.87) 

Rewriting the second term: 

£ n u fM x f *d l/ xP = \e\ [( jc " 0 „ + x v d») + (X»d u - X„d»)} xp 

= \£\(x»d v -x u d»)xp , (16.88) 

then, because of the antisymmetry of e v At , it results that 

6*xP = I [-£% {X»d v - x u d») + ip . 

Hence, for the space-like components ( i,k = 1,2,3) one gets 

6*xp = I [-£*,. {x‘d k - x k &) + £*/<*>] xp . (16.89) 

Up to the factor i/ft the first expression in brackets is just the angular momentum 
operator 

L i k = ^(x i d k -x k d i ) . (16.90) 

The second term, i. e. (i/ft)/ ( '*\ is independent of the coordinates and therefore 
also of the coordinate system; thus it is straightforward to interpret this term as an 
“ inner” angular momentum, i. e. as a spin. With the help of the generators we see 
that [ see (16.21-16.23)] 

l v 

s 2 = -J^ = h(P^ + J 2 ^) , 

S 3 = ^/< 21 > = ft (y 3 <+) + / 3 <->) , (16.91) 

Now, the commutation relations of the 5, are a direct consequence of the commu- 
tation relations of the 7' (±) and thus of the generators of the infinitesimal Lorentz 
transformation, i. e. 

S\§2 — S 2 S 1 — iftS 3 (16.92) 

and cyclic commutations therefrom. 

Consequently the operators 5, have all the properties of an angular momentum 
operator: thus for a given integer or half-integer s, S3 has the (2s + 1) eigenvalues 
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(— 5 , — s + 1, . . s — 1, s). Furthermore there exists a Casimir operator S 2 = S 2 , 

with the eigenvalues h 2 s(s + 1). 

Using these considerations, we define the operator of the total angular momen- 
tum to be 

M ik = Y (x‘& - X k d i ) + = L + S , (16.93) 

and hence we get from (16.89) that 


8*xp = \e ik ^M ik xP . (16.94) 

Thus the coefficient of the local variation of the field defines the operator of the 
total angular momentum, and we conclude: To determine the spin of the field 
ip(x) we must find infinitesimal operators generating space-like rotations in the 
representation space which is determined by the transformation properties 
of ip. Let us now only consider the subgroup of the Lorentz group L p which 
characterizes the space-like rotations. Furthermore we can then conclude: In this 
case also, the matrices M from SL(2, C), which belong to space-like rotations, are 
unitary [not only unimodular, and thus they do not belong to SU(2)] . To understand 
the consequences we define the direct product of two matrices A and B as 

C = A® B (16.95) 

or, in matrix notation, 

Aa’yBp/j . (16.96) 


This definition becomes clearer in the following example: 


A®B 


^11 «12 
«21 «22 


b\] b\2 
bi\ b^i 


/ a \\b\\ 
a\\b2\ 
®2\b\\ 

V 021^21 

(a n B 

[ci2iB 


anb\2 
Club22 
U2\b\2 
<*2\b22 
a 12 B\ 
«22^ ) 


anbn 

anb2\ 

a 22b\\ 

«22^21 


anb\2 \ 
anb22 
<*22^12 
^22^22 / 


(16.97) 


Of course this operation is not commutative, i. e. 


B ®A = 


b\\A b\2A 
b2\A ^ 22-4 


+ A®B 


With the help of definition (16.97), one can easily derive the following properties: 

1. The direct product of two diagonal matrices is diagonal. 

2. (A + B)® C -f ( A ® C') T ( B ® (7). 

3. If A, A' are n x n matrices and B, B m x m matrices, then it holds that 
(A ® B){A' ® B') =AA'® BB'. 

4. If A and B are unitary, then A® B is also unitary 

5. tr(A ® B) = trA trfi. 
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With the help of these definitions we are now able to rewrite the representation 
(16.72) as 




/ aa * 

ab* 

ba* 

bb 

1 ac* 

ad* 

be* 

bd 

ca* 

cb* 

da* 

db 

\cc* 

cd* 

dc* 

dd 


a b 
c d 


a* b*\ 
c* d* ) ' 

(16.98) 


From this example one can conclude that this representation can be described as a 
direct product, i. e. for the matrices we have 

b ii \M,M) = b i {M)®D i '(M) . (16.99) 

The matrices D j (M) and D J ' (M ) are not identical with the matrices of the rep- 
resentation D>, D> of the SU(2)! Of course this is only the case if the matrix 
M is unitary. But this is just the case for space-like rotations. Thus for the total 
representation space we can write: 

D' j '=D i ®D i ' , (16.100) 


i. e. for space-like rotations the representation D 11 of the Lorentz group L p splits 
up into a direct product of the representations D 1 and D 1 of the three-dimensional 
rotation group, each D 1 representing a different spin. Therefore, in general a co- 
variant Lorentz field tf> a (x M ) does not possess unique spin representations, but splits 

up into different spin representations D 1 , D 1 . 

First we consider the representation D 1 . As we have seen it is (2 j + 1) di- 
mensional. In this space the infinitesimal operators which determine the spin are 
(2 j + 1) x (2j + 1) matrices and consequently have (2 j + 1) eigenvalues. Degeneracy 
does not appear, because the quantities spanning up the representation space are 
linearly independent. But we have seen that these are the 25 + 1 eigenvalues of 
the spin, i.e. the spin is given by the index j of the representation D 1 . For example, 
y=0 belongs to the scalar representation D° describing spinless fields. For the 
vector representation D x the spin has the value 1. This becomes clear with help 
of the infinitesimal operators: For space-like rotations we have: 

/ 0 1 OX 

/ 3 =/ (2 *)= -10 0 . 

V 0 0 0/ 


exercise 

16.3 Vector Representation and Spin 

Problem. Show that spin 1 belongs to the vector representation D ] . 

Solution. The vector representation D l is just the self-representation of the space 
of three-dimensional rotations, i. e. the spin values are given by the eigenvalues of 
the corresponding generators of the Lorentz group. For three-dimensional rotations 
this is 
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Exercise 16.3. 


f 0 1 0\ 

p =/( 21 ) =1-1 0 0 I 

V 0 0 0/ 

The characteristic equation reads 


det(/ 3 - AH) = 0 , i.e. 


-A 

-1 

0 


1 0 
-A 0 
0 -A 


= 0 = — A(1 + A 2 ) 


hence A = 0, ±i. Because s = (h/ i)A, one gets s = (— 1,0, l)h. 


For the representation D 2 , one finds analogously, s = h/2, etc. Without proof 
we note that (16.100) can be written as a sum with the help of a generalized 
Clebsch-Gordan theorem: 

jyi' =D> ® D' = D> +j ' 0 0 . . . 0 &->' . (16.101) 

Therefore we have, for example, 

D 2 ° = d' 2 ®D° = d' 2 , 


because 


D 2 = ± 


e-iv/ 2 o 
0 e iv> / 2 


D 2 is a self-representation of SU(2) (i. e. of all unitary, unimodular matrices), and 
indeed, D° 2 is a matrix of SU(2) (Dp. 

From (16.101) it follows that: the spin which belongs to the representation D lj 
is 


half integer, if j + j' is half-integer 
integer, if j +j' is integer . 

However, this means: the tensor representation belongs to the integer spins, the 
spinor representation to the half-integer spins. 

The described representation is not unique, which is illustrated in the following 
example: One has 

D' 2 ' 2 = D 2 ® D 2 = D x ® D° . (16.102) 

According to our rules D ^ describes spin-1 fields. However, we see that (16.102) 
also contains scalar parts, which do not lead to spin values s = 1 . To get a unique 
representation one has to introduce auxiliary fields, which just eliminate this scalar 
part ( D ° ). Such auxiliary conditions play an important role in the quantum theory 
of the Maxwell field. 
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